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PREFACE 

This introduction to Group Theory, with its emphasis on Lie 
Groups and their application to the study of symmetries of the 
fundamental constituents of matter, has its origin in a one-semester 
course that I taught at Yale University for more than ten years. The 
course was developed for Seniors, and advanced Juniors, majoring 
in the Physical Sciences. The students had generally completed the 
core courses for their majors, and had taken intermediate level 
courses in Linear Algebra, Real and Complex Analysis, Ordinary 
Linear Differential Equations, and some of the Special Functions of 
Physics. Group Theory was not a mathematical requirement for a 
degree in the Physical Sciences. The majority of existing 
undergraduate textbooks on Group Theory and its applications in 
Physics tend to be either highly qualitative or highly mathematical. 
The purpose of this introduction is to steer a middle course that 
provides the student with a sound mathematical basis for studying 
the symmetry properties of the fundamental particles. It is not 
generally appreciated by Physicists that continuous transformation 
groups (Lie Groups) originated in the Theory of Differential 
Equations. The infinitesimal generators of Lie Groups therefore 
have forms that involve differential operators and their commutators, 
and these operators and their algebraic properties have found, and 
continue to find, a natural place in the development of Quantum 
Physics. 

Guilford, CT. 
June, 2000. 


1 
INTRODUCTION 

The notion of geometrical symmetry in Art and in Nature is 
a familiar one. In Modern Physics, this notion has evolved to 
include symmetries of an abstract kind. These new symmetries play 
an essential part in the theories of the microstructure of matter. The 
basic symmetries found in Nature seem to originate in the 
mathematical structure of the laws themselves, laws that govern the 
motions of the galaxies on the one hand and the motions of quarks 
in nucleons on the other. 

In the Newtonian era, the laws of Nature were deduced from 
a small number of imperfect observations by a small number of 
renowned scientists and mathematicians. It was not until the 
Einsteinian era, however, that the significance of the symmetries 
associated with the laws was fully appreciated. The discovery of 
space-time symmetries has led to the widely held belief that the 
laws of Nature can be derived from symmetry, or invariance, 
principles. | Our incomplete knowledge of the fundamental 
interactions means that we are not yet in a position to confirm this 
belief. We therefore use arguments based on empirically 
established laws and restricted symmetry principles to guide us in 
our search for the fundamental symmetries. Frequently, it is 
important to understand why the symmetry of a system is observed 
to be broken. 

In Geometry, an object with a definite shape, size, location, 
and orientation constitutes a state whose symmetry properties, or 
invariants, are to be studied. Any transformation that leaves the 


state unchanged in form is called a symmetry transformation. The 


greater the number of symmetry transformations that a state can 
undergo, the higher its symmetry. If the number of conditions that 
define the state is reduced then the symmetry of the state is 
increased. For example, an object characterized by oblateness alone 
is symmetric under all transformations except a change of shape. 

In describing the symmetry of a state of the most general 
kind (not simply geometric), the algebraic structure of the set of 
symmetry operators must be given; it is not sufficient to give the 
number of operations, and nothing else. The law of combination of 
the operators must be stated. It is the algebraic group that fully 
characterizes the symmetry of the general state. 

The Theory of Groups came about unexpectedly. Galois 
showed that an equation of degree n, where n is an integer greater 
than or equal to five cannot, in general, be solved by algebraic 
means. In the course of this great work, he developed the ideas of 
Lagrange, Ruffini, and Abel and introduced the concept of a group. 
Galois discussed the functional relationships among the roots of an 
equation, and showed that they have symmetries associated with 
them under permutations of the roots. 

The operators that transform one functional relationship 
into another are elements of a set that is characteristic of the 
equation; the set of operators is called the Galois group of the 
equation. 

In the 1850’s, Cayley showed that every finite group is 
isomorphic to a certain permutation group. The geometrical 
symmetries of crystals are described in terms of finite groups. 
These symmetries are discussed in many standard works (see 


bibliography) and therefore, they will not be discussed in this book. 


In the brief period between 1924 and 1928, Quantum 
Mechanics was developed. Almost immediately, it was recognized 
by Weyl, and by Wigner, that certain parts of Group Theory could 
be used as a powerful analytical tool in Quantum Physics. Their 
ideas have been developed over the decades in many areas that 
range from the Theory of Solids to Particle Physics. 

The essential role played by groups that are characterized by 
parameters that vary continuously in a given range was first 
emphasized by Wigner. These groups are known as Lie Groups. 
They have become increasingly important in many branches of 
contemporary physics, particularly Nuclear and Particle Physics. 
Fifty years after Galois had introduced the concept of a group in the 
Theory of Equations, Lie introduced the concept of a continuous 
transformation group in the Theory of Differential Equations. Lie’s 
theory unified many of the disconnected methods of solving 
differential equations that had evolved over a period of two hundred 
years. Infinitesimal unitary transformations play a key role in 
discussions of the fundamental conservation laws of Physics. 

In Classical Dynamics, the invariance of the equations of 
motion of a particle, or system of particles, under the Galilean 
transformation is a basic part of everyday relativity. The search for 
the transformation that leaves Maxwell’s equations of 
Electromagnetism unchanged in form (invariant) under a linear 
transformation of the space-time coordinates, led to the discovery of 
the Lorentz transformation. The fundamental importance of this 


transformation, and its related invariants, cannot be overstated. 


2 
GALOIS GROUPS 
In the early 19th - century, Abel proved that it is not possible to 

solve the general polynomial equation of degree greater than four by 
algebraic means. He attempted to characterize all equations that can 
be solved by radicals. Abel did not solve this fundamental problem. 
The problem was taken up and solved by one of the greatest innovators 
in Mathematics, namely, Galois. 
2.1. Solving cubic equations 

The main ideas of the Galois procedure in the Theory of 
Equations, and their relationship to later developments in Mathematics 
and Physics, can be introduced in a plausible way by considering the 
standard problem of solving a cubic equation. 

Consider solutions of the general cubic equation 

Ax? + 3Bx’ + 3Cx + D=0, 
where A - D are rational constants. 
If the substitution y = Ax + B is made, the equation becomes 

y'+3Hy+G=0 
where 
H=AC-B* 
and 
G = A’D - 3ABC + 2B’. 

The cubic has three real roots if G’ + 4H’ < 0 and two imaginary roots 
if G? + 4H’ >0. (See any standard work on the Theory of Equations). 

If all the roots are real, a trigonometrical method can be used to 
obtain the solutions, as follows: 


the Fourier series of cos*u is 


cos*u = (3/4)cosu + (1/4)cos3u. 

Putting 

y = scosu in the equation y* + 3Hy + G=0 (s>0), 
gives 

cos*u + (3H/s’)cosu + G/s* = 0. 
Comparing the Fourier series with this equation leads to 

s = 2 v(-H) 
and 
cos3u = -4G/s° 
If v is any value of u satisfying cos3u = —4G/s’, the general solution is 
3u = 2nx + 3v, (nis an integer). 
Three different values of cosu are given by 
u=v, and 27/3 + v. 
The three solutions of the given cubic equation are then 
scosv, and scos(2m/3 + v). 
Consider solutions of the equation 
x -3x+1=0. 
In this case, 
H =-1 and G* + 4H’ = -3. 
All the roots are therefore real, and they are given by solving 
cos3u = —4G/s* = -4(1/8) = -1/2 
or, 
3u =cos'(-1/2). 

The values of u are therefore 27/9, 4/9, and 81/9, and the roots are 


X, = 2cos(2m/9), x, = 2cos(4/9), and x; = 2cos(87/9). 


10 


2.2. Symmetries of the roots 
The roots x,, X, and x; exhibit a simple pattern. Relationships 
among them can be readily found by writing them in the complex 
form: 
2cos0 =e’ + e"° where 0 = 27/9, so that 
x, =e" +e", 
a = e278 +4 e728 : 
and 
X;= eff 4 eto. 


Squaring these values gives 


x =x, +2, 
es 
X, =X,+2, 
and 
X; =X, +2. 


The relationships among the roots have the functional form f(x,,x,,x;) 
= (0. Other relationships exist; for example, by considering the sum of 
the roots we find 

X, +X +x,-2=0 

X +X; +x,-2=0, 
and 

X, +x +x,-2=0. 
Transformations from one root to another can be made by doubling- 
the-angle, 0. 

The functional relationships among the roots have an algebraic 

symmetry associated with them under interchanges (substitutions) of 
the roots. If O is the operator that changes f(x,,x,,x;) into f(x,,x3,X,) 


then 
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Of(X,,X),X3) — £(X5,X3,X)), 
O’£(X,,X5,X3) —> £(K;,X,,X,), 
and 
OP £(X,,X5,X3) > £(K,,X5,X3). 
The operator O° = I, is the identity. 
In the present case, 
O(x,’ - x, - 2) = (x? - x, - 2) =0, 
and 
O'(x,’ — x, - 2) =(x,’ — x, - 2) =0. 
2.3. The Galois group of an equation. 

The set of operators {I, O, O°} introduced above, is called the 
Galois group of the equation x* — 3x + 1 = 0. (It will be shown later 
that it is isomorphic to the cyclic group, C;). 

The elements of a Galois group are operators that interchange 
the roots of an equation in such a way that the transformed functional 
relationships are true relationships. For example, if the equation 

X, +X, +x,-2=0 
is valid, then so is 
O(%, +X) +X, -2) =x, +x, +x,-2=0. 
True functional relationships are polynomials with — rational 
coefficients. 
2.4. Algebraic fields 

We now consider the Galois procedure in a more general way. 

An algebraic solution of the general nth - degree polynomial 
a,x" + a,x"! +...a,=0 
is given in terms of the coefficients a; using a finite number of 


operations (+,-.x,+,V). The term "solution by radicals" is sometimes 
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used because the operation of extracting a square root is included in 
the process. If an infinite number of operations is allowed, solutions 
of the general polynomial can be obtained using transcendental 
functions. The coefficients a, necessarily belong to a field which is 
closed under the rational operations. If the field is the set of rational 
numbers, Q, we need to know whether or not the solutions of a given 
equation belong to Q. For example, if 

x°-3=0 
we see that the coefficient -3 belongs to Q, whereas the roots of the 
equation, x, = + V3, do not. It is therefore necessary to extend Q to Q', 
(say) by adjoining numbers of the form av3 to Q, where a is in Q. 

In discussing the cubic equation x* - 3x + 1 = 0 in 2.2, we 
found certain functions of the roots f(x,,x,,x;) = 0 that are symmetric 
under permutations of the roots. The symmetry operators formed the 
Galois group of the equation. 

For a general polynomial: 

x" + ax"! + ax"? + ..a, =0, 
functional relations of the roots are given in terms of the coefficients in 
the standard way 
My PR Ky eet +X, =a, 
XjXy FP RyKy cs RyKy H Roky tk + RR, = Ay 


X1XX3 + XyX3X4 + eee + Xy-2Xn-1 Xn = a3 


X1XX3 susie ares Xy-1Xn = a,. 
Other symmetric functions of the roots can be written in terms 
of these basic symmetric polynomials and, therefore, in terms of the 


coefficients. Rational symmetric functions also can be constructed 
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that involve the roots and the coefficients of a given equation. For 
example, consider the quartic 
x’ + ax’? +a,=0. 

The roots of this equation satisfy the equations 

X, +X, +X,+x,=0 

X\X_ + XjX3 + XXy + XpX3 + X.X4 + XzX4 = a, 

XX oX3 + XjXoX4 + X1X3X4 + X.X3X, =O 

XXp.XzXy4 = ay. 

We can form any rational symmetric expression from these 
basic equations (for example, (3a,° — 2a,)/2a,’ = f(x,,X,,X3,X,)). In 
general, every rational symmetric function that belongs to the field F 
of the coefficients, a,, of a general polynomial equation can be written 
rationally in terms of the coefficients. 

The Galois group, Gal, of an equation associated with a field F 
therefore has the property that if a rational function of the roots of the 
equation is invariant under all permutations of Gal, then it is equal to a 
quantity in FP. 

Whether or not an algebraic equation can be broken down into 
simpler equations is important in the theory of equations. Consider, 
for example, the equation 

x= 3, 
It can be solved by writing x’ = y, y° =3 or 
x= 6/3)". 

To solve the equation, it is necessary to calculate square and 
cube roots — not sixth roots. The equation x° = 3 is said to be 
compound (it can be broken down into simpler equations), whereas x” 


= 3 is said to be atomic. The atomic properties of the Galois group of 
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an equation reveal the atomic nature of the equation, itself. (In 
Chapter 5, it will be seen that a group is atomic ("simple") if it 
contains no proper invariant subgroups). 

The determination of the Galois groups associated with an 
arbitrary polynomial with unknown roots is far from straightforward. 
We can gain some insight into the Galois method, however, by 
studying the group structure of the quartic 

x* + ax’? +a,=0 
with known roots 
X, = ((-a, + )/2)'? , x, =-X,, 
X3 = ((-a, — p)/2)'" , x, =-X;, 
where 
= (a,” = 4a,)"”. 

The field F of the quartic equation contains the rationals Q, and 
the rational expressions formed from the coefficients a, and a,. 

The relations 

X, +X, =xX,+x,=0 
are in the field F. 
Only eight of the 4! possible permutations of the roots leave 


these relations invariant in F; they are 


X, Xo X3 Xy X, X> X3 Xy X, X> X3 Xy 

‘ P, = ? P, = > P; = ? 
X, Xo X3 Xy X, X> X4 Xz Xy X; X3 Xy 
X, Xo Xz Xy X, X> X3 Xy X, X> Xz Xy 

P, a: ’ P; = Py P, = Dy 


Ke Ry Re, Kp ey has Mee Xp 5 Rk 
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XK Ks Ny Ky he Ke 


bw Ome, ape ee Xj .%s KX} 


The set {P,,...P,} is the Galois group of the quartic in F. It is a 
subgroup of the full set of twentyfour permutations. We can form an 
infinite number of true relations among the roots in F. If we extend 
the field F by adjoining irrational expressions of the coefficients, other 
true relations among the roots can be formed in the extended field, F'. 
Consider, for example, the extended field formed by adjoining pw (= 
(a,” — 4a,)) to F so that the relation 
x -x,=pisinF. 
We have met the relations 
X,; =-X, and x, =-x, 
so that 
x Su, and x7 =x. 
Another relation in F' is therefore 
iS Sk 
The permutations that leave these relations true in F' are then 
{Peas Py Paps 

This set is the Galois group of the quartic in F'. It is a subgroup of the 
set {Pj,...P,}. 

If we extend the field F' by adjoining the irrational expression 
((-a, — 1)/2)'” to form the field F" then the relation 

X= k= 2((-ay — )/2)"* isin F". 


This relation is invariant under the two permutations {P,, P;}. 
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This set is the Galois group of the quartic in F". It is a subgroup of the 
set 
1 PrP Pay P 

If, finally, we extend the field F" by adjoining the irrational 

((-a, + u)/2)'” to form the field F" then the relation 

X, — X= 2((—a, — x)/2)'” is in F". 
This relation is invariant under the identity transformation, P, , alone; 
it is the Galois group of the quartic in F"'. 

The full group, and the subgroups, associated with the quartic 
equation are of order 24, 8,4, 2, and 1. (The order of a group is the 
number of distinct elements that it contains). In 5.4, we shall prove 
that the order of a subgroup is always an integral divisor of the order 
of the full group. The order of the full group divided by the order of a 
subgroup is called the index of the subgroup. 

Galois introduced the idea of a normal or invariant subgroup: if 
H is a normal subgroup of G then 

HG - GH = [H, G] =0. 
(H commutes with every element of G, see 5.5). 
Normal subgroups are also called either invariant or self-conjugate 
subgroups. A normal subgroup H is maximal if no other subgroup of 
G contains H. 
2.5. Solvability of polynomial equations 

Galois defined the group of a given polynomial equation to be 
either the symmetric group, S,, or a subgroup of S,, (see 5.6). The 
Galois method therefore involves the following steps: 


1. The determination of the Galois group, Gal, of the equation. 
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2. The choice of a maximal subgroup of H In the above case, 


‘max(1)* 


{P,, ...P,} is a maximal subgroup of Gal = S,. 


3. The choice of a maximal subgroup of H from step 2. 


max(I) 
In the above case, {P,,..P,} = H,,,.2) iS a maximal subgroup of H,,,41)- 
The process is continued until H,,,,, = {P;} = {I}. 

The groups Gal, Hynaxc1)> «max = 1, form a composition series. 
The composition indices are given by the ratios of the successive 
orders of the groups: 

£,/Hqy, byy/hy,, ...bgy/1. 

The composition indices of the symmetric groups S, for n = 2 to 7 are 
found to be: 
Composition Indices 
2 
pie 
pe nS | 
2, 60 
2, 360 


7 2,2520 


Nn nr FBP W NY SB 


We state, without proof, Galois' theorem: a polynomial equation can 
be solved algebraically if and only if its group is solvable. 

Galois defined a solvable group as one in which the 
composition indices are all prime numbers. Furthermore, he showed 
that if n > 4, the sequence of maximal normal subgroups is S,, A,, I, 
where A, is the Alternating Group with (n!)/2 elements. The 
composition indices are then 2 and (n!)/2. For n > 4, however, (n!)/2 
is not prime, therefore the groups S, are not solvable for n > 4. Using 
Galois' Theorem, we see that it is therefore not possible to solve, 


algebraically, a general polynomial equation of degree n > 4. 
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3 
SOME ALGEBRAIC INVARIANTS 
Although algebraic invariants first appeared in the works of 

Lagrange and Gauss in connection with the Theory of Numbers, the 
study of algebraic invariants as an independent branch of Mathematics 
did not begin until the work of Boole in 1841. Before discussing this 
work, it will be convenient to introduce matrix versions of real bilinear 
forms, B, defined by 

B= Diet” Lier’ AX; 
where 

X = [X,,X5,...X,,], an m-vector, 

y = Ly;.Y>,---Y,], an n-vector, 


and a, are real coefficients. The square brackets denote a column 


vector. 
In matrix notation, the bilinear form is 
B=x'Ay 
where a s 
ayy ain 
A= 
Ani Aion 


Ry ay 


The scalar product of two n-vectors is seen to be a special case of a 
bilinear form in which A =I. 
If x = y, the bilinear form becomes a quadratic form, Q: 
Q =x'Ax. 
3.1. Invariants of binary quadratic forms 


Boole began by considering the properties of the binary 
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quadratic form 
Q(x,y) = ax’ + 2hxy + by’ 


under a linear transformation of the coordinates 


x' = Mx 
where 
x =[x,y], 
ij 
x'=[x',y'], and M = 
k 1 


The matrix M transforms an orthogonal coordinate system into an 
oblique coordinate system in which the new x'- axis has a slope (k/i), 


and the new y'- axis has a slope (I/j), as shown: 


[it+j,k+1] 


[0,1] 


[0 0] [ 1 0] xX 


The transformation of a unit square under M. 
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The transformation is linear, therefore the new function 
Q'(x'y') is a binary quadratic: 
Q'\(x',y') = ax"? as 2h'x'y' 4 b'y”. 


The original function can be written 


Q(x,y) = x'Dx 
where 
a h 
D= 
h b 


and the determinant of D is 
detD = ab — h’, is the discriminant of Q. 


The transformed function can be written 


Qix'y) =x"Dx 
where 
ah 
D'= 
h' b' 
and 
detD' = a'b' - h”, the discriminant of Q'. 
Now, 


Q'(x',y') = (Mx)'D'Mx 
= x'M'D'Mx 
and this is equal to Q(x,y) if 
M'DM =D. 
The invariance of the form Q(x,y) under the coordinate transformation 


M therefore leads to the relation 
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(detM)’detD' = detD 
because 
detM" = detM. 
The explicit form of this equation involving determinants is 
(il — jk)’(a'b' — h”) = (ab — h’). 

The discriminant (ab - h’) of Q is said to be an invariant of the 
transformation because it is equal to the discriminant (a'b' — h”) of Q', 
apart from a factor (il — jk)’ that depends on the transformation itself, 
and not on the arguments a,b,h of the function Q. 
3.2. General algebraic invariants 

The study of general algebraic invariants is an important 
branch of Mathematics. 

A binary form in two variables is 

(yk) Sax! Say ee KS 
= y ib eas oe 

If there are three or four variables, we speak of ternary forms or 
quaternary forms. 

A binary form is transformed under the linear transformation 
M as follows 

f(s) = Tox, Ha alk ee, 
The coefficients 
dos Ais ene Ay 5 As ae 4% 
and the roots of the equation 
f(x,,X,) =0 
differ from the roots of the equation 
£(x,'.X,') = 0. 


Any function I(a,,a,,...a,) of the coefficients of f that satisfies 
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jal Eager: Buea: Weed Ea he pee: 

is said to be an invariant of f if the quantity r depends only on the 
transformation matrix M, and not on the coefficients a, of the function 
being transformed. The degree of the invariant is the degree of the 
coefficients, and the exponent w is called the weight. In the example 
discussed above, the degree is two, and the weight is two. 

Any function, C, of the coefficients and the variables of a form 
f that is invariant under the transformation M, except for a 
multiplicative factor that is a power of the discriminant of M, is said to 
be a covariant of f. For binary forms, C therefore satisfies 
FClay aj 468, $e] Xs) S Cacao) 

It is found that the Jacobian of two binary quadratic forms, 
f(x,,x,) and g(x,,x,), namely the determinant 


Of/dx, df/dx, 


dg/dx, dg/dx, 


where 0f/dx, is the partial derivative of f with respect to x, etc., is a 
simultaneous covariant of weight one of the two forms. 
The determinant 


d°f/dx, — d°£/Ax, 0x, 
0°g/0x,0x, 0°g/dx,° 
called the Hessian of the binary form f, is found to be a covariant of 


weight two. A full discussion of the general problem of algebraic 


invariants is outside the scope of this book. The following example 
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will, however, illustrate the method of finding an invariant in a 
particular case. 
Example: 
To show that 
(a,a — a,”)(a,a3 — a”) — (aa; — aa,)"/4 
is an invariant of the binary cubic 
f(x,y) = a,x? + 3a,x’y + 3a,xy’ + ay” 
under a linear transformation of the coordinates. 
The cubic may be written 
f(x,y) = (a,x’+2a,xy+a,y’)x + (a,x°+2a,xytay )y 
= x'Dx 
where 
x=[xyl, 
and 


aX + ayy aX + ay 


aX + ay axXt+a, 


Let x be transformed to x': x' = Mx, where 


ij 
M = 
k ] 
then 
f(x,y) = f(x y') 
if 


D=M'DM. 


Taking determinants, we obtain 
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detD = (detM)’detD', 
an invariant of f(x,y) under the transformation M. 


In this case, D is a function of x and y. To emphasize this 


point, put 

detD = (x,y) 
and 

detD'= o'(x',y') 
so that 


(x.y) = (detM)’p'(x'y') 


= (a,x + ayy)(a,X + ayy) — (a,x + ay)” 


ee: De 
= (a,a) — a)°)x" + (a,a3 — a)a))xy + (a)a; - a,°)y 


= x'Ex, 
where 
(aa) — a,” ) (a,a3; — aa,)/2 
EK= 
(a,a3 — a)ay)/2 (aja; — ay ) 


Also, we have 


o'(xty') = x"E'x' 
= x'M'E'Mx 
therefore 
x' Ex = (detM)’x'M'E'Mx 
so that 


E = (detM)’M'EM. 


Taking determinants, we obtain 
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detE = (detM)‘detE' 
= (a,a) — a;")(aja3 — ay) — (a,a3 - a,a5)"/4 
= invariant of the binary cubic f(x,y) under the 
transformation x' = Mx. 
4 
SOME INVARIANTS OF PHYSICS 
4.1. Galilean invariance. 

Events of finite extension and duration are part of the 
physical world. It will be convenient to introduce the notion of 
ideal events that have neither extension nor duration. Ideal events 
may be represented as mathematical points in a_ space-time 
geometry. A particular event, E, is described by the four 
components [t,x,y,z] where t is the time of the event, and x,y,z, are 
its three spatial coordinates. The time and space coordinates are 
referred to arbitrarily chosen origins. The spatial mesh need not be 
Cartesian. 

Let an event E[t,x], recorded by an observer O at the origin 
of an x-axis, be recorded as the event E'[t',x'] by a second observer 
O', moving at constant speed V along the x-axis. We suppose that 
their clocks are synchronized at t = t' = 0 when they coincide at a 
common origin, x = x'=0. 

At time t, we write the plausible equations 
t=t 
and 
x'=x- Vt, 
where Vt is the distance traveled by O' in a time t. These equations 
can be written 


E' = GE 
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where 


G is the operator of the Galilean transformation. 


The inverse equations are 


t= 
and 

xX =x'+Vt' 
or 

E =G'E' 


where G' is the inverse Galilean operator. (It undoes the effect of 
G). 

If we multiply t and t' by the constants k and k', respectively, 
where k and k' have dimensions of velocity then all terms have 
dimensions of length. 

In space-space, we have the Pythagorean form x’ + y? =r’, 
an invariant under rotations. We are therefore led to ask the 
question: is (kt)? + x’ invariant under the operator G in space-time? 
Calculation gives 

(kt)? + x? = (k't')? +x + 2Vx't' + Vt? 

= (k't')’ +x” only if V=0. 
We see, therefore, that Galilean space-time is not Pythagorean in its 
algebraic form. We note, however, the key role played by 


acceleration in Galilean-Newtonian physics: the velocities of the 


ZF 


events according to O and O' are obtained by differentiating the 
equation x' = —Vt + x with respect to time, giving 
vi=-Vitv, 
a plausible result, based upon our experience. 
Differentiating v' with respect to time gives 
dv'/dt' = a'=dv/dt=a 
where a and a' are the accelerations in the two frames of reference. 
The classical acceleration is invariant under the Galilean 
transformation. If the relationship v' = v — V is used to describe the 
motion of a pulse of light, moving in empty space at v=c = 3 x 10° 
m/s, it does not fit the facts. All studies of ultra high-speed 
particles that emit electromagnetic radiation show that v' = c for all 
values of the relative speed, V. 
4.2. Lorentz invariance and Einstein's space-time symmetry. 

It was Einstein, above all others, who advanced our 
understanding of the true nature of space-time and relative motion. 
We shall see that he made use of a symmetry argument to find the 
changes that must be made to the Galilean transformation if it is to 
account for the relative motion of rapidly moving objects and of 
beams of light. He recognized an inconsistency in the Galilean- 
Newtonian equations, based as they are, on everyday experience. 
We shall restrict the discussion to non-accelerating, inertial frames. 

We have seen that the classical equations relating the events 


E and E' are E' = GE, and the inverse E = G’'E' where 


G = and G'= 
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These equations are connected by the substitution V <> -V; this is 
an algebraic statement of the Newtonian Principle of Relativity. 
Einstein incorporated this principle in his theory. He also retained 
the linearity of the classical equations in the absence of any 
evidence to the contrary — equispaced intervals of time and distance 
in one inertial frame remain equispaced in any other inertial frame. 
He therefore symmetrized the space-time equations as follows: 
t= t-—-—vVx 
x'=-Vx + t 
(The zero in G is replaced by —-V) 
Note, however, the inconsistency in the dimensions of the time- 
equation that has now been introduced: 
t'= t- Vx. 
The term Vx has dimensions of [L]*/[T], and not [T]. This can be 
corrected by introducing the invariant speed of light, c — a postulate 
in Einstein's theory that is consistent with experiment: 
ct'=ct — Vx/c 
All terms in the equation now have dimensions of length. 

Einstein went further, and introduced a dimensionless 
quantity y instead of the scaling factor of unity that appears in the 
Galilean equations of space-time. This factor must be consistent 
with all observations. The equations then become 

ct'= -yct — Byx 
x'=-Byct+ yx, where B=V/c. 
These can be written 
E' = LE, 


where 
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y SRY 
= , and E = [ct, x] 


L is the operator of the Lorentz transformation. 


The inverse equation is 


E =L'E 
where 
y By 
L'= 
By oY 


This is the inverse Lorentz transformation, obtained from L by 
changing 6 — -B (or V —~ -V); it has the effect of undoing the 


transformation L. We can therefore write 


LL =I, or 
y -By y By 1 0 

-By ¥ By ¥ 0 1 

Equating elements gives 
v 7 By a] 
therefore, 
y = INC - 6’) (taking the positive 

root). 


4.3. The invariant interval. 
Previously, it was shown that the space-time of Galileo and 


Newton is not Pythagorean in form. We now ask the question: is 
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Einsteinian space-time Pythagorean in form? Direct calculation 
leads to 
(ct)? + (x =y°(1 + B’\(ct') + 4By*x'ct' 
+y°(1 + B’)x” 
z(ct') +(x') if B>0. 
Note, however, that the difference of squares is an invariant under 
L: 
(ct) — (x) = (et!) - Oy 
because 
Y(L-B’)=1. 
Space-time is said to be pseudo-Euclidean. 
The negative sign that characterizes Lorentz invariance can 
be included in the theory in a general way as follows. 
We introduce two kinds of 4-vectors 
x" = [x°, x’, x’, x*], a contravariant 
vector, 
and 
X, = [Xo. X15 X25 X3], a covariant vector, 
where 
= [kee oe eed 
The scalar product of the vectors is defined as 
MO Sey sk Fe eee ee 
= (x) — (x')Y + OY + OY) 
The event 4-vector is 
E” = [ct, x, y, z] and the covariant form 
is 


E, = [ct,-x,-y,-Z] 


31 


so that the Lorentz invariant scalar product is 
E'"E, = (ct) - (x* + y? +z’). 

The vector x" transforms as x” = Lx” where L is 

( y-By 0 0) 

See vas Oe 


0 0 1 O 
0 0 0 Av 


This is the operator of the Lorentz transformation if the motion of 
O' is along the x-axis of O's frame of reference. 

Important consequences of the Lorentz transformation are 
that intervals of time measured in two different inertial frames are 
not the same but are related by the equation 

At' = yAt 
where At is an interval measured on a clock at rest in O's frame, and 
distances are given by 
Al' = Al/y 
where Al is a length measured on a ruler at rest in O's frame. 
4.4. The energy-momentum invariant. 

A differential time interval, dt, cannot be used in a Lorentz- 
invariant way in kinematics. We must use the proper time 
differential interval, dt, defined by 

(cdt)? — dx’ = (cdt')? — dx” = (cdt)’. 

The Newtonian 3-velocity is 

Vy = [dx/dt, dy/dt, dz/dt], 
and this must be replaced by the 4-velocity 
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V" = [d(ct)/dt, dx/dt, dy/dt, dz/dt] 

= [d(ct)/dt, dx/dt, dy/dt, dz/dt]dt/dt 

= lyc.yvn] . 
The scalar product is then 

VV, = (yey’ = (Vn) 

= (yc)'(1 — (vp/c)’) 

ne; 
(In forming the scalar product, the transpose is understood). 
The magnitude of the 4-velocity is | V“| = c, the invariant speed of 
light. 

In Classical Mechanics, the concept of momentum is important 
because of its role as an invariant in an isolated system. We 
therefore introduce the concept of 4-momentum in Relativistic 
Mechanics in order to find possible Lorentz invariants involving 
this new quantity. The contravariant 4-momentum is defined as: 

Pp" = mv" 
where m is the mass of the particle. (It is a Lorentz scalar, the mass 
measured in the frame in which the particle is at rest). 


The scalar product is 


Pep = (nic); 
Now, 
P* = [myc, myvy] 
therefore, 
P“P, = (myc) - (myvy)’. 
Writing 


M = ym, the relativistic mass 


we obtain 


33 


P“P, = (Mc) - (Mvy)’ = (mc)’. 
Multiplying throughout by c’ gives 
M’c* — M’y,’c? = m’c*. 
The quantity Mc’ has dimensions of energy; we therefore write 
E = Mc’ 
the total energy of a freely moving particle. 
This leads to the fundamental invariant of dynamics 
¢’P*P, = E’ - (pc)? = E” 
where 
E® = mc’ is the rest energy of the 
particle, and p is its relativistic 3-momentum. 
The total energy can be written: 
E=yE°=E°+T, 
where 
T=E'y - 1), 
the relativistic kinetic energy. 
The magnitude of the 4-momentum is a Lorentz invariant 
[P| Sane; 
The 4- momentum transforms as follows: 
PY = LP’. 
For relative motion along the x-axis, this equation is equivalent to 


the equations 


E'= YE - Bycp* 
and 
cp’ =-ByE+ ycp*. 


Using the Planck-Einstein equations E = hv and 
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E = p’c for photons, the energy equation becomes 
v'=yv - Byv 
= yv(1 - B) 
v(1 — B)/(1 - B*y'” 
vil — By + BI". 


This is the relativistic Doppler shift for the frequency v', measured 


in an inertial frame (primed) in terms of the frequency v measured 
in another inertial frame (unprimed). 
4.5. The frequency-wavenumber invariant 
Particle-wave duality, one of the most profound discoveries 

in Physics, has its origins in Lorentz invariance. It was proposed by 
deBroglie in the early 1920's. He used the following argument. 

The displacement of a wave can be written 

y(t,r) = Acos(wt — ker) 
where w = 2mv (the angular frequency), |k| = 2n/A (the 
wavenumber), and 
r = [x, y, z] (the position vector). The phase (wt - ker) can be 
written ((w/c)ct - ker), and this has the form of a Lorentz invariant 
obtained from the 4-vectors 
E“[ct, r], and K"[o/c, k] 

where E” is the event 4-vector, and K” is the "frequency- 
wavenumber" 4-vector. 

deBroglie noted that the 4-momentum P” is connected to the 
event 4-vector E” through the 4-velocity V"“, and the frequency- 
wavenumber 4-vector K” is connected to the event 4-vector E” 


through the Lorentz invariant phase of a wave ((w/c)ct - ker). He 
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therefore proposed that a direct connection must exist between P” 
and K"; it is illustrated in the ae diagram: 


‘Tetr] 


(Einstein) P"P,= inv. E’K,= Inv. (deBroglie) 


“TE/c,p] K“[o/c,k] 


(deBroglie) 


The coupling between P" and K" via E". 


deBroglie proposed that the connection is the simplest possible, 

namely, P“ is proportional to K”. He realized that there could be 
only one value for the constant of proportionality if the Planck- 
Einstein result for photons, E = hw/2z, is but a special case of a 
general result; it must be h/2ma, where h is Planck’s constant. 
Therefore, deBroglie proposed that 

PY «x K* 
or 

PY = (h/20)K". 
Equating the elements of the 4-vectors gives 

E = (h/21)o 
and 


p = (h/2m)k . 
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In these remarkable equations, our notions of particles and 
waves are forever merged. The smallness of the value of Planck's 
constant prevents us from observing the duality directly; however, it 
is clearly observed at the molecular, atomic, nuclear, and particle 
level. 

4.6. deBroglie's invariant. 
The invariant formed from the frequency-wavenumber 4-vector 
is 
K"K,, = (o/c, k)[@/c,-k] 
= (w/c) — k? = (w°/c)’, where w° is 
the proper angular frequency. 
This invariant is the wave version of Einstein's energy- 
momentum invariant; it gives the dispersion relation 
w” =’ — (kc)’. 
The ratio w/k is the phase velocity of the wave, vy. 
For a wave-packet, the group velocity v, is dw/dk; it can be 
obtained by differentiating the dispersion equation as follows: 
wdw — ke*dk = 0 
therefore, 
Vg = dw/dk = kc’/o. 
The deBroglie invariant involving the product of the phase and 
group velocity is therefore 
VV = (/k).(ke’/w) = c’. 
This is the wave-equivalent of Einstein's 
E=Mc’. 
We see that 
VoVg = ¢ = E/M 
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or, 

Vg = E/Mv, = Ek/Mw = p/M = vy, the particle velocity. 
This result played an important part in the development of Wave 
Mechanics. 

We shall find in later chapters, that Lorentz transformations 
form a group, and that invariance principles are related directly to 
symmetry transformations and their associated groups 

5 
GROUPS — CONCRETE AND ABSTRACT 
5.1 Some concrete examples 

The elements of the set {+1, +1}, where i = V1, are the roots 
of the equation x* = 1, the “fourth roots of unity”. They have the 
following special properties: 

1. The product of any two elements of the set (including the 
same two elements) is always an element of the set. (The elements 
obey closure). 

2. The order of combining pairs in the triple product of any 
elements of the set does not matter. (The elements obey associativity). 

3. A unique element of the set exists such that the product of 
any element of the set and the unique element (called the identity) is 
equal to the element itself. (An identity element exists). 

4. For each element of the set, a corresponding element exists 
such that the product of the element and its corresponding element 
(called the inverse) is equal to the identity. (An inverse element 
exists). 

The set of elements {+1, +1} with these four properties is said 


to form a GROUP. 
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Here, the law of composition of the group is multiplication; this 
need not be the case. For example, the set of integers Z = {... -2, -1, 
0,1,2,... } forms a group if the law of composition is addition; in this 
group, the identity element is zero and the inverse of each integer is 
the integer with the same magnitude but with opposite sign. 
In a different vein, we consider the set of 4x4 matrices 
1000 0001 0010) (0100 
{M}= |0100/;,;1000 ,, 0001), 0010). 
0010 0100 1000} 0001 
0001) \0010) \0100) G000 
If the law of composition is matrix multiplication, then {M} is found 
to obey: 
1 -closure, 
2 --associativity, 
and to contain: 
3 --an identity, diag(1, 1,1, 1), 
and 
4 --inverses. 
The set {M} forms a group under matrix multiplication. 
5.2. Abstract groups 
The examples given above illustrate the generality of the group 
concept. In the first example, the group elements are real and 
imaginary numbers, in the second, they are positive and negative 
integers, and in the third, they are matrices that represent linear 
operators (see later discussion). Cayley, in the mid-19th century, first 
emphasized this generality, and he introduced the concept of an 


abstract group G that is a collection of n distinct elements (...g;...) for 


oo 


which a law of composition is given. If n is finite, the group is said to 
be a group of order n. The collection of elements must obey the four 
rules: 
1. Ifg;,g,€ G then g, = g°g, GV g,, g, © G (closure) 
2. 24(g;8;) = (2,8); [omitting the composition symbole] (associativity) 
3. de €G such that g.e = eg; = g, V g, € G (an identity exists) 
4. If g, € G then J g,' € G such that g, 'g, = g9,'=e 
(an inverse exists). 
For finite groups, the group structure is given by listing all 


compositions of pairs of elements in a group table, as follows: 


e.g; g, . <-([st symbol, or operation, in pair) 
e 
Si - SiS Si; 
8 - Si8i 858) 
&k » SSi Bri 


If gg, = gig, V g;, g, © G, then G is said to be a commutative or abelian 
group. The group table of an abelian group is symmetric under 
reflection in the diagonal. 

A group of elements that has the same structure as an abstract 
group 1s a realization of the group. 
5.3 The dihedral group, D, 

The set of operations that leaves an equilateral triangle 
invariant under rotations in the plane about its center, and under 
reflections in the three planes through the vertices, perpendicular to the 


opposite sides, forms a group of six elements. A study of the structure 
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of this group (called the dihedral group, D;) illustrates the typical 
group-theoretical approach. 
The geometric operations that leave the triangle invariant are: 
Rotations about the z-axis (anticlockwise rotations are positive) 
R,O) (123) ~ (123) =e, the identity 
R,(27/3)(123) = (312) =a 
R,(41/3)(123) = (231) =a’ 
and reflections in the planes I, IT, and HI: 
R, (123) = (132) =b 
R,, (123) = G21) =c 
R,, (123) = (213) =d 
This set of operators is D, = {e, a, a’, b,c, d}. 
Positive rotations are in an anticlockwise sense and the inverse 
rotations are in a clockwise sense, so that the inverse of e, a, a’ are 
e'=e,a' =a’, and (a’)' =a. 
The inverses of the reflection operators are the operators themselves: 
b'=b,c'=c,andd' =d. 
The set D, forms a group with the multiplication table: 


eaalbcd 


e eaabcd 
a aaeldbec 


a aeaicdb 


io” 
Co 
Q 
a 
oO 
ia) 
ioe) 
nN 


Q 
QO 
Qa. 
ox 
© 
N 
oO 
rot) 


Q. 
a 
oO 
ie) 
~ 
rat) 
oO 
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In reading the table, we follow the rule that the first operation is 
written on the right: for example, ca’ = b. A feature of the group D, is 
that it can be subdivided into sets of either rotations involving {e, a, 
a’} or reflections involving {b, c,d}. The set {e, a, a’} forms a group 
called the cyclic group of order three, C,;. A group is cyclic if all the 
elements of the group are powers of a single element. The cyclic 
group of order n, C,, is 
C, = {e, a, a’, a°, .....,a7'}, 

where n is the smallest integer such that a" = e, the identity. Since 

atg"* = g"=e, 
an inverse a"“ exists. All cyclic groups are abelian. 

The group D, can be broken down into a part that is a group C;, 
and a part that is the product of one of the remaining elements and the 
elements of C;. For example, we can write 

D, =C,+bC, ,b EC, 

= f{e,a,a°} + {b, ba, ba’} 

= {e,a,a°} + {b,c, d} 

=cC; =dC,, 
This decomposition is a special case of an important theorem known 
as Lagrange’s theorem. (Lagrange had considered permutations of 
roots of equations before Cauchy and Galois). 
5.4 Lagrange’s theorem 

The order m of a subgroup H,,, of a finite group G,, of order n is 
a factor (an integral divisor) of n. 

Let 

G, = {Z)= €, 2, 23, Z,} be a group of order n, and let 
H,,= {h,=e,h,,h,, h,,} be a subgroup of G, of order m. 
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If we take any element g, of G, that is not in H,,, we can form the set 
of elements 

{g.hj, g.ho, gh, ...2¢,h,} = 2H. 
This is called the left coset of H,, with respect to g,. We note the 
important facts that all the elements of g,h,, j=1 to m are distinct, and 
that none of the elements g,h, belongs to H.,,. 

Every element g, that belongs to G, but does not belong to H,, 
belongs to some coset g,H,, so that G, forms the union of H,, and a 
number of distinct (non-overlapping) cosets. (There are (n — m) such 
distinct cosets). Each coset has m different elements and therefore the 
order n of G, is divisible by m, hence n = Km, where the integer K is 
called the index of the subgroup H,, under the group G,. We therefore 
write 

G, = 2:0 + SoH + isn + «SKA 
where 

go EG, EH, 

83 €G, € H,, gH, 


Sax © Gy E An» SH» SisHme Sn, K-1Hm- 

The subscripts 2,3, 4, ..K are the indices of the group. 
As an example, consider the permutations of three objects 1, 2, 

3 (the group S;) and let H,, = C; = {123, 312, 231}, the cyclic group of 
order three. The elements of S, that are not in H, are {132, 213, 321}. 
Choosing g, = 132, we obtain 

¢ He 132,321,213}, 
and therefore 


a = C3 + gC; K = 2. 
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This is the result obtained in the decomposition of the group D, if we 
make the substitutions e = 123, a = 312, a’ = 231, b = 132, c = 321, 
and d= 213. 

The groups D, and S, are said to be isomorphic. Isomorphic 
groups have the same group multiplication table. Isomorphism is a 
special case of homomorphism that involves a many-to-one 
correspondence. 

5.5 Conjugate classes and invariant subgroups 

If there exists an element v € G, such that two elements a, b € 
G, are related by vav' = b, then b is said to be conjugate to a. A finite 
group can be separated into sets that are conjugate to each other. 

The class of G, is defined as the set of conjugates of an 
element a © G,. The element itself belongs to this set. If a is 
conjugate to b, the class conjugate to a and the class conjugate to b are 
the same. If a is not conjugate to b, these classes have no common 
elements. G, can be decomposed into classes because each element of 
G, belongs to a class. 

An element of G, that commutes with all elements of G, forms 
a class by itself. 

The elements of an abelian group are such that 

bab’ = a for all a,b EG,, 
so that 
ba = ab. 
If H,, is a subgroup of G,, we can form the set 


{aea’, ah,a"’, ....ah,a'} =aH,a' where a€G, . 
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Now, aH,,a"' is another subgroup of H,, in G,. Different subgroups 
may be found by choosing different elements a of G,. If, for all values 
of aEGG, 
aH,,a' = H,, 

(all conjugate subgroups of H,, in G, are identical to H,,), 
then H,, is said to be an invariant subgroup in G,,. 

Alternatively, H,, is an invariant in G, if the left- and right- 
cosets formed with any a € G, are equal, i. e. ah; = h,a. 

An invariant subgroup H,, of G, commutes with all elements of 
G,. Furthermore, if h; € H,, then all elements ah,a' € H,, so that H,, is 
an invariant subgroup of G, if it contains elements of G, in complete 
classes. 

Every group G, contains two trivial invariant subgroups, H,, = 
G, and H,, = e. A group with no proper (non-trivial) invariant 
subgroups is said to be simple (atomic). If none of the proper invariant 
subgroups of a group is abelian, the group is said to be semisimple. 

An invariant subgroup H,, and its cosets form a group under 
multiplication called the factor group (written G,/H,,) of H,, in G,. 

These formal aspects of Group Theory can be illustrated by 
considering the following example: 
The group D, = {e, a, a’, b, c, d} ~ S, = {123, 312, 231, 132, 321, 
213}. C,is a subgroup of S;: C; = H; = fe, a, a°} = {123, 312, 231}. 
Now, 

bH, = {132, 321, 213} =H,b 
cH, = {321, 213, 132} = H,c 

and 


dH, = {213,132, 321} = Hd. 
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Since H, is a proper invariant subgroup of S;, we see that S; is not 
simple. H, is abelian therefore S, is not semisimple. 
The decomposition of S; is 
S;, = H, + bH, = H, + H;b. 
and, in this case we have 
H,b = H,c = Hd. 
(Since the index of H, is 2, H; must be invariant). 


The conjugate classes are 


cac! = cac =a’ 


dad‘ = dad = a” 
The class conjugate to a is therefore {a, a°}. 
The class conjugate to b is found to be {b, c, d}. The group S, can be 
decomposed by classes: 
S, = {e} + {a, a°} + {b, c,d}. 
S, contains three conjugate classes. 
If we now consider H,, = {e, b} an abelian subgroup, we find 
aH,, = {a,d}, H,,a = {a.c}, 
vH,, = {a’,c}, H,,a’ = {a°, d}, etc. 
All left and right cosets are not equal: H,, = {e, b} is therefore not an 
invariant subgroup of S;. We can therefore write 
S, = {e, b} + {a, d} + {a’,c} 
= H 


+ aH, + @H,. 


m 
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Applying Lagrange’s theorem to S; gives the orders of the possible 
subgroups: they are 

order 1: {e} 

order 2: {e, d}; {e, c}; {e, d} 

order 3: {e, a, a°} (abelian and invariant) 

order 6: S;. 
5.6 Permutations 

A permutation of the set {1, 2, 3, .....n} of n distinct elements is 
an ordered arrangement of the n elements. If the order is changed then 
the permutation is changed. The number of permutations of n distinct 
elements is n! 

We begin with a familiar example: the permutations of three 
distinct objects labeled 1, 2, 3. There are six possible arrangements; 
they are 

123,312,251 132,521,213; 


These arrangements can be written conveniently in matrix form: 


123 ) 123) 123 
T = Th = 1 = ; 

1230 (312) 231 

123 123 123 
Tt, = is.= 1 = 

132 (321 213) 


The product of two permutations is the result of performing one 


arrangement after another. We then find 


JUN; = Tt 
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and 

It,7, = Tt, 
whereas 

JU,I; = I, 
and 

Ts, = My. 


The permutations 7, 2, %, commute in pairs (they correspond to 
rotations in the dihedral group) whereas the remaining permutations do 
not commute (they correspond to reflections). 

A general product of permutations can be written 


Si S54 2 Se | ie aera | eam | 


tts coat | Biggs ese tbe. t 
The permutations are found to have the following properties: 
1. The product of two permutations of the set {1, 2, 3, ...} is itself a 
permutation of the set. (Closure) 
2. The product obeys associativity: 
(11,55); = 14,(5,7,), (not generally commutative). 
3. An identity permutation exists. 
4. An inverse permutation exists: 


Si Syd a 5S, 


12...n 


such that su! = 2'm = identity permutation. 


The set of permutations therefore forms a group 
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5.7 Cayley’s theorem: 
Every finite group is isomorphic to a certain permutation 
group. 
Let G, ={2), 2, 23, - - -2,+ be a finite group of order n. We choose 
any element g, in G,, and we form the products that belong to G,: 
Si81> Si82» 8i83. - + - Si&n- 
These products are the n-elements of G rearranged. The permutation 


It,, associated with g; is therefore 


£1 So : f En 


Sif: iS. . . SiSn 


If the permutation 7, associated with g;is 


£1 So : : Sn 
= 
Si Si22 . . SiSn 
where g; # g;, then 
£1 &2 7 : En 
L (ggigi (gigdS -  - (En) 


This is the permutation that corresponds to the element g;¢; of G,. 
There is a direct correspondence between the elements of G, and the n- 


permutations {7,, 1, . . .,}. The group of permutations is a 
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subgroup of the full symmetric group of order n! that contains all the 
permutations of the elements g,,g5,. . g,. 

Cayley’s theorem is important in quantum systems in which 
the indistinguishability of the fundamental particles means that certain 
quantities must be invariant under the exchange or permutation of the 
particles. 

6 
LIE’S DIFFERENTIAL EQUATION, INFINITESIMAL 
ROTATIONS AND ANGULAR MOMENTUM OPERATORS 

Although the field of continuous transformation groups (Lie 
groups) has its origin in the theory of differential equations, we shall 
introduce the subject using geometrical ideas. 

6.1 Coordinate and vector rotations 

A 3-vector Vv = [V,, Vy, V,] transforms into v = [v,’, vy’, v,/] 
under a general coordinate rotation R about the origin of an orthogonal 
coordinate system as follows: 

v =R yy, where 
iv ji ki 


R 


ij jj kj 
ko jk kK’ 


cos6;.. 


cos6;,. 

cos8,, . cosO,.- 

in which i, j, k, i’, j°, k° are orthogonal unit vectors, along the axes, 
before and after the transformation, and the cos6,,’s are direction 


cosines. 
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The simplest case involves rotations in the x-y plane: 


— “y 
Vv. | = | cos0; cos0;, Vx 
Vy: cos8;, cosO;, | | Vv 


y 


= | cosd sing | | v, |=R)v 
—sind cos | | v, 

< 
where R,(@) is the coordinate rotation operator. If the vector is rotated 


in a fixed coordinate system, we have ¢ — -9@ so that 


v =R Oy, 
where 
x 
R,(o) = | cosd sing 
sind = cos 


6.2 Lie’s differential equation 

The main features of Lie’s Theory of Continuous 
Transformation Groups can best be introduced by discussing the 
properties of the rotation operator R,(@) when the angle of rotation is 
an infinitesimal. In general, R,(@) transforms a point P[x, y] in the 
plane into a “new” point P’[x’, y’]: P’- = R,(@)P. 
Let the angle rotation be sufficiently small for us to put cos() = 1 and 


sin(d) = 6, in which case, we have 
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R,(5) = ; 1 -d> 
Kee 1 

and 

x= x.l-ydp=x —- ydo 

y =xdot+y.1 =xdo+y 
Let the corresponding changes x — x’ and y ~ y’ be written 

x° =x +6x and y =y +dy 
so that 

dx = -ydo and dy = xd. 
We note that 


= 1+ ido 
where 

i= | 0 -1/= R\(a2). 

1.0 
Lie introduced another important way to interpret the operator 
i = R,(a/2) that involves the derivative of R,(d) evaluated at the 
identity value of the parameter, @ = 0: 
dR,(p)/dp| = | -sind -cosp || = |0 -1) = i 
%-9 | cosd —sing 1 0 


so that 


R,(5o) = 1+ dR,@)/do | . 89, 
o=0 
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a quantity that differs from the identity I by a term that involves the 
infinitesimal, 50: this is an infinitesimal transformation. 
Lie was concerned with Differential Equations and not 


Geometry. He was therefore motivated to discover the key equation 


dR,(o)/do = f | cosd —singd 
UU O sind cos 
= iR,@). 


This is Lie’s differential equation. 


Integrating between > = 0 and $ = 9, we obtain 


R\@) o 
SAR YR) = if do 
I 0 
so that 
In(R,(p)/D) = id, 
or 


R,(o) = Ie®, 
the solution of Lie’s equation. 
Previously, we obtained 

R,(o) = Icoso + ising. 
We have, therefore 

Ie = Icosd + ising . 
This is an independent proof of the famous Cotes-Euler equation. 
We introduce an operator of the form 
O = g(x, y, 0/0x, 0/dy), 

and ask the question: does 


dx = Of(x, y; 60) ? 
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Lie answered the question in the affirmative; he found 
dx = O(xd0) = (xd/dy — yd/dx)xdo = -ydh 
and 
dy = O(ydo) = (xd/dy — yd/dx)ydd = xdq. 
Putting x = x, and y = x,, we obtain 
6x; = Xx,6o ,1=1,2 
where 
X = O = (x,0/0x, — x,0/0x,), 
the “generator of rotations” in the plane. 
6.3 Exponentiation of infinitesimal rotations 
We have seen that 
R,(p) =e" 
and therefore 
R, (5p) = 1 + 189, 
for an infinitesimal rotation, do 
Performing two infinitesimal rotations in succession, we have 
R,°(S) = (1+ 159)” 
= I + 2i8¢ to first order, 
= R,(260). 
Applying R,(6o) n-times gives 
R,"(6p) = R,(nS9) =e” = e* 
= R,(@) (as n > © and 69 = 0, 
the product ndo = 6). 
This result agrees, as it should, with the exact solution of Lie’s 
differential equation. 
A finite rotation can be built up by exponentiation of 


infinitesimal rotations, each one being close to the identity. In general, 
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this approach has the advantage that the infinitesimal form of a 
transformation can often be found in a straightforward way, whereas 
the finite form is often intractable. 
6.4 Infinitesimal rotations and angular momentum operators 
In Classical Mechanics, the angular momentum of a mass m, 
moving in the plane about the origin of a cartesian reference frame 
with a momentum p is 
L, =r x p =rpsinon, 
where n, is a unit vector normal to the plane, and @ is the angle 
between r and p. In component form, we have 
ji Xp, — yp,, where p, and p, are the 
cartesian components of p. 
The transition between Classical and Quantum Mechanics is 
made by replacing 
p, by —-i(h/27)0/ox (a differential operator) 
and p, by —i(h/21)d/dy (a differential operator), 
where h is Planck’s constant. The quantum operator is therefore 
L,° = -i(h/2)(xd/dy — yo/dx) =—i(h/2m)X 
so that 
X = iL,°/(h/2z), 
and 
dx;= Xx, 8 = (20iL,2/h)x; 8h, i = 1,2. 
Let an arbitrary, continuous, differentiable function f(x, y) be 
transformed under the infinitesimal changes 
x° =x- ydo 
y =yt+xdd. 


Using Taylor’s theorem, we can write 
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f(x’, y’) = f(x + 6x, y + dy) 
= f(x — ydo, y + xd9) 
= f(x, y) + (0f/0x)6x + (0f/dy)dy 
= f(x, y) + 6@(-y(0/0x) + x(0/dy))f(x, y) 
=1+ 2midoL,/h)f(x, y) 
= emttiah fix, y) 
= R,(2aL,50/h) f(x, y). 
The invariance of length under rotations follows at once: 
if f(x, y)=x’+y° then 
Of/0x = 2x and Of/dy = 2y, therefore 
f(x’, y’) = f(x, y) + 2x6x + 2ydy 
= f(x, y) — 2x(yd) + 2y(xdq) = f(x, y). 
This is the only form that is length-invariant under rotations. 
6.5 3-dimensional rotations 
Consider three successive counterclockwise rotations about the 


x, y, and z”’ axes through angles u, 0, and 9, respectively: 


u about x 
a 


Zz 
y 
x 
\é 
z! y’ 
8 about y” 
—_> 
x! 


Z 


o about z”* 
a ad 
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The total transformation is R,(u, 8, >) = R.(o)R,(6)R,(u) 
ad 
cosdcos@ cosdsinOsinu + sindcosu —cosdsinOcosu + singdsinu 
= |-singdcos8 -sindsinOsinu + cosdcosu — singsinOcosu + cosdsinu 


sinO —cos@sinu cosOcosu 


For infinitesimal rotations, the total rotation matrix is, to lst-order in 


the 6’s: 


1 do -d0 
R (du, 50,50) = | -do 1 du 
50 -du 1 | 


1 8 0) (1 0-86) (1 0 0 
= |-8610//0 1 0||0 1 ou 
0 0 1/(80 0 1]| 0-1 

oe 


= (14+ Y,8o)(1+ Y,80 )(1+ Y,5p) 


where 


000 0 0-1 (010 
Y,=|0 0 1,,¥,=|000|, ¥,4-100 
0-1 0 100 000 


To l|st-order in the 6’s, we have 


R,(du, 80, 5) =I + Y,Su + Y,80 + Y,59. 


6.6 Algebra of the angular momentum operators 
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The algebraic properties of the Y’s are important. For example, we 


find that their commutators are: 


(0 0 0)/0 0-1 0 0-1\/(0 00) 
[Y,,¥.] =}0 0 1;;/0 0 0; - |0 0 0]/0 O01 
0-1 0)\1 0 0 (1 0 0;\0-1 0 
= -Y,, 
[Y;,Y¥,] = ¥, 
and 
[Y,,Y;] = -Y,. 
These relations define the algebra of the Y’s. In general, we 
have 


[Y, YJ = +Y=e Y., 


where €;,, is the anti-symmetric Levi-Civita symbol. It is equal to +1 if 


jkl is an even permutation, -1 if jkl is an odd permutation, and it is 


equal to zero if two indices are the same. 


Motivated by the relationship between L, and X in 2- 


dimensions, we introduce the operators 
J, =-i2wh)Y, ,k = 1,2, 3. 
Their commutators are obtained from those of the Y’s, for example 
[Y,, Y,]=-Y;— [2miJ,/h, 2miJ,/h] = -2m1J,/h 

or 

-[J,, J,](2a/hy = -2niJ,/h 
and therefore 

[J,. J] = ihJ,/27 . 


These operators obey the general commutation relation 


58 


(Jj, J] = they J)/20. 
The angular momentum operators form a “Lie Algebra”. 
The basic algebraic properties of the angular momentum 
operators in Quantum Mechanics stem directly from this relation. 
Another approach involves the use of the differential operators 
in 3-dimensions. A point P[x, y, z] transforms under an infinitesimal 
rotation of the coordinates as follows 
P[x’,y’,z’] =R.(u, 68, d]PIx, y, z] 
Substituting the infinitesimal form of R, in this equation gives 
dx=x°-x= ydo — 260 
dy =y -y= -xdo + ZOU 
éz=z -z= xd0-ydu. 
Introducing the classical angular momentum operators: L;", we find 
that these small changes can be written 
3 


dx, = > ba, Xx, 


1 


k=1 


For example, if i= 1 
6x, =6x = du(zd/dy - yd/dz)x 
+ 60(—z0/0x + x0/dz)x 
+ d( yd/dx - xd/dy)x 
=-z60 + ydd. 
Extending Lie’s method to three dimensions, the infinitesimal 
form of the rotation operator is readily shown to be 


3 
R.(du, 80, 8) = I + ¥ (GR/Ga,)| « 80, . 


v1 All ai’s =0 
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7 
LIE’S CONTINUOUS TRANSFORMATION GROUPS 

In the previous chapter, we discussed the properties of 
infinitesimal rotations in 2- and 3-dimensions, and we found that 
they are related directly to the angular momentum operators of 
Quantum Mechanics. Important algebraic properties of the matrix 
representations of the operators also were introduced. In this 
chapter, we shall consider the subject in general terms. 

Let x;,i = 1 to n be a set of n variables. They may be 
considered to be the coordinates of a point in an n-dimensional 
vector space, V,. A set of equations involving the x,’s is obtained 
by the transformations 

X;° = £,(X,, Xo,..Xqi ay, a,..a,),1= 1 ton 
in which the set a, a,, ...a, contains r-independent parameters. The 
set T,, of transformations maps x — x’. We shall write 

x° = f(x; a) orx’ = T,x 
for the set of functions. 

It is assumed that the functions f,; are differentiable with 
respect to the x’s and the a’s to any required order. These functions 
necessarily depend on the essential parameters, a. This means that 
no two transformations with different numbers of parameters are the 
same. r is the smallest number required to characterize the 
transformation, completely. 

The set of functions f, forms a finite continuous group if: 
1. The result of two successive transformations x — x” — x” is 
equivalent to a single transformation x — x’: 


x” = f(x*; b) = f(f(x; a); b) 
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f(x; c) 
f(x; x(a; b)) 


where c is the set of parameters 
¢, = % (ab), A= 1 tor, 
and 
2. To every transformation there corresponds a unique inverse that 
belongs to the set: 
4 a such that x = f(x’; a) = f(x’; a) 
We have 
T,T,' = T,'T, = I, the identity. 
We shall see that 1) is a highly restrictive requirement. 

The transformation x = f(x; a) is the identity. Without loss 
of generality, we can take a, = 0. The essential point of Lie’s 
theory of continuous transformation groups is to consider that part 
of the group that is close to the identity, and not to consider the 
group as a whole. Successive infinitesimal changes can be used to 
build up the finite change. 

7.1 One-parameter groups 

Consider the transformation x — x’ under a finite change in 

a single parameter a, and then a change x° + dx’. There are two 


paths from x — x’ + dx’; they are as shown: 


an “infinitesimal” 
‘Ne ; 
da te parameter change, 
x’ + dx’ 


a+da 
a “differential” * 


x(a = 0) 
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We have 
x° + dx’ = f(x; a + da) 
= f(f(x; a); da) = f(x’; da) 
The Ist-order Taylor expansion is 
dx’ = Of(x*; a)/da| da = u(x’) da 
a=0 
The Lie group conditions then demand 


a+ da = (a; da). 


But 
x(a; 0) =a, (b =0) 
therefore 
at+da=a+t dx(a; by/ab 5a 
b=0 
so that 
da = dx(a; b)/db | Sa 
b=0 
or 
da = A(a)da. 
Therefore 
dx* = u(x’)A(a)da, 
leading to 
dx“/u(x*) = A(a)da 
so that 
Sf dx’/u(x’) =fA(a)da = s 
x 0 


(s = 0 = the identity). 
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We therefore obtain 
U(x’) - U(x) =s. 
A transformation of coordinates (new variables) therefore transfers 
all elements of the group by the same transformation: a one- 
parameter group is equivalent to a group of translations. 
Two continuous transformation groups are said to be similar 
when they can be obtained from one another by a change of 


variable. For example, consider the group defined by 


x a 0 | “| 
x, | =| 0 a’ | Xp, 
The identity coprresponds to a = 1. The infinitesimal 


transformation is therefore 
x, (1 + da) 0 ) x, 
re 0 el +a Kisii 
J 


To Ist-order in 6a we have 


X, = x, +x,6a 


and 

X, = X, + 2x,6a 
or 

6x, = x,6a 
and 

Ox, = 2x,da. 


In the limit, these equations give 


dx,/x, = dx,/2x, = da. 
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These are the differential equations that correspond to the 
infinitesimal equations above. 


Integrating, we have 


x’ a oa ‘ 
J-dxisy = fdaand: f.dx/2x = da. 
xl 0 x2 0 
so that 
Inx,” -— Inx, = a = In(x,7/x,) 
and 
In(x,7/x,) = 2a = 2In(x,7/x,) 
or 
UW? Se SUS Geers 


Putting V = Inx,, we obtain 
V’ = V +a andU’= U, the translation group. 
7.2 Determination of the finite equations from the infinitesimal 
forms 
Let the finite equations of a one-parameter group G,,) be 

Ki = OOK Ky a) 
and 

Ky” = W(X, %. 5a), 
and let the identity correspond to a =0. 
We consider the transformation of f(x,, x,) to f(x,*, x.°). We 
expand f(x,’, x,_) in a Maclaurin series in the parameter a (at 
definite values of x, and x,): 

f(x,“,x.°) = f(0) + f(O)a + f(O)a7/2! + ... 

where 


£0) = £017, Xo) aco = F(K), Xp), 
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and 
f°(0) = (df(x,’, x,”)/dal ,_5 
={(0f/0x,°)(dx,“/da)+ (0f/0x,°)(dx,“/da)}1 ,-o 
={(0f/0x,“)u(X,, X.°) + (Of/0X,7)V(X,", X27) Flaco 
therefore 


f°(O) = {(u(d/dx,) + v(0/0x,))fHI 5 

= Xf(x,, X)). 

Continuing in this way, we have 
f (Oy aid ox, ida tly = XiGg, a5). €tesz, 
The function f(x,’, x“) can be expanded in the series 
f(x,°, X.°) = f(0) + af((O) + (a°/2!)f°(0O) +... 

= f(x,,X,) + aXf + (a7/2!)X’f + ... 
X"f is the symbol for operating n-times in succession of f with X. 
The finite equations of the group are therefore 

xX) = X, + aXx, +(a’°/2!)X’x, + ... 
and 
X) = X) + aXx, +(a°/2!)X’x, + ... 

If x, and x, are definite values to which x,’and x, reduce for the 
identity a=0, then these equations are the series solutions of the 
differential equations 

dx,“/u(x,", X,°) = dx,7/v(x,°, X.°) = da. 
The group is referred to as the group Xf. 
For example, let 

Xf = (x,0/0x, + Xx,0/0x,)f 
then 
X; =X, + aXx, + (a’/2!)X°F ... 


=X, + a(x,0/0x, + x,0/0x,)x, + ... 
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=X, +ax, + (a7/2!)(x,0/dx, 
+ x,0/0X,)x,; + 
=x, + ax, + (a7/2!)x, + ... 


=x 1 + a+ a7/2! + ...) 


= ee: 
Also, we find 
x) = ee 
Putting b = e*, we have 
xX, = bx, and x,” =bx,. 


The finite group is the group of magnifications. 
If X = (xd/dy —- yo/dx) we find, for example, that the finite group is 
the group of 2-dimensional rotations. 
7.3 Invariant functions of a group 

Let 

Xf = (ud/dx, + vod/0x,)f 
define a one-parameter group, and let a = 0 give the identity. A 
function F(x,, x,) is termed an invariant under the transformation 
group G_,) if 
F(x,", X5°) = F(x,, xX») 

for all values of the parameter, a. 

The function F(x,”, x,”) can be expanded as a series in a: 


F(x,’, X5”) = F(x,, x») + aXF + (a/2!)X(XF) + ... 


If 
F(x,’, X.) = F(X, xX.) 
= invariant for all values of a, it is 
necessary for 
XF = 0. 


This means that 
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{u(x,, X,)0/0x, + V(X, X,)0/0x,}F = 0. 
Consequently, 
F(x,, X,) = constant 
is a solution of 


dx,/u(x,, X>) dx, /V(Xj5 Xs) 


This equation has one solution that depends on one arbitrary 
constant, and therefore G,,, has only one basic invariant, and all 
other possible invariants can be given in terms of the basic 
invariant. 

For example, we now reconsider the invariants of rotations: 

The infinitesimal transformations are given by 

Xf = (x,0/0x, — x,0/0x,), 
and the differential equation that gives the invariant function F of 
the group is obtained by solving the characteristic differential 
equations 
dx,/x, = do, and dx,/x, = —-d@, 
so that 
dx,/x, + dx,/x, = 0. 
The solution of this equation is 
x, + x,” =constant, 
and therefore the invariant function is 
F(x,,X)) = X) + X,’. 

All functions of x,7 + x,’ are therefore invariants of the 2- 
dimensional rotation group. 

This method can be generalized. A group G,, in n-variables 
defined by the equation 


Xi = o(x,, Xo, X35 oe Xys a),i = 1 to Nn, 
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is equivalent to a unique infinitesimal transformation 
ME = (yy Kos Kay se %,JOT/OX,. + oc y(Xyy Ky Kgs «.-K,OT/Ox, 
If a is the group parameter then the infinitesimal transformation is 
X; = X; + U(X), X),...x,)da Gi = 1 ton), 
then, if E(x,, X,, ...x,) 1s a function that can be differentiated n-times 
with respect to its arguments, we have 
E(X,*, Xo", .X,°) = E(x), X,...X,) + aXE + (a7/2!)X’E +. 
Let (X,, X», ..-X,) be the coordinates of a point in n-space and let a be 
a parameter, independent of the x,’s. As a varies, the point (X,, X5, 
...X,) Will describe a trajectory, starting from the initial point (x,, xX), 
...X,). A necessary and sufficient condition that F(x,, x;, ...x,) be an 
invariant function is that XF = 0. A curve F = 0 is a trajectory and 
therefore an invariant curve if 
EK, Xo Keo = 0. 
8 
PROPERTIES OF n-VARIABLE, r-PARAMETER LIE 
GROUPS 
The change of an n-variable function F(x) produced by the 
infinitesimal transformations associated with r-essential parameters is: 


dF => (dF/0x,)dx, 


i=l 
where 


dx, = ¥ u,,(x)da, , the Lie form. 


r=1 


The parameters are independent of the x,’s therefore we can write 
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dF = ¥ $a,{¥ u,(x)(0/0x,)F} 


rA=1 i=l 


where the infinitesimal generators of the group are 


n 


X, = > u,(x)(0/0x,) , A= 1 tor. 


i=l 


The operator 


ig 


I+ ¥ X,6a, 
hel 
differs infinitesimally from the identity. 
The generators X, have algebraic properties of basic importance in the 
Theory of Lie Groups. The X,’s are differential operators. The 
problem is therefore one of obtaining the algebraic structure of 
differential operators. This problem has its origin in the work of 
Poisson (1807); he introduced the following ideas: 
The two expressions 

X,f = (u,,0/0x, + U,,0/0x,)f 
and 

Xf = (U,,0/0x, + U,,0/0x,)f 
where the coefficients u;, are functions of the variables x,, x,, and f(x,, 
X,) 1s an arbitrary differentiable function of the two variables, are 
termed linear differential operators. 


The “product” in the order X, followed by X, is defined as 
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X,X,f = (u,,0/0x, + u,,0/0x,)(u,,0f/0x, + U,,0f/0x,) 
The product in the reverse order is defined as 
X,X,f = (u,,0/0x, + u,,0/0x,)(u,,0f/0x, + U,,0f/0x,). 
The difference is 
X,X,f - X,X,f = X,u,,0f/ox, + X,u,,0f/0x, 
— X,u,,0f/dx, — X,u,,0f/0x,. 
= (X,u,, -— X.u,,)df/ox, 
+ (X,u,, — X,u,,)df/dx, 
= [X,, X, Jf. 
This quantity is called the Poisson operator or the commutator of the 
operators X,f and X,f. 
The method can be generalized to include ’ = 1 to r essential 
parameters and i = | ton variables. The ath-linear operator is then 
X, = u,,0f/0x, 
= ¥ u,,0f/0x, 
i=l 
(Sum over repeated indices). 
Lie’s differential equations have the form 
0x,/da, = U,(X)A,(a) ,i= 1 ton,A=1 tor. 
Lie showed that 
(0C,,,/0a,)U;, = O 
in which 
U,,0U;,/OX; — U,,0U;,/OX; = Cyq(a)Uy(X), 
so that the c,,,,’s are constants. Furthermore, the commutators can be 
written 


[X,, X,] = ( CypoU,)0/0X; 
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= CkpoXx- 
The commutators are linear combinations of the X,’s. (Recall the 
earlier discussion of the angular momentum operators and _ their 
commutators). 
The c,,,’8 are called the structure constants of the group. They have 
the properties 
Cyog = Crop > 
Cinaeonac Ciedoine oe Suepeuue => Os 
Lie made the remarkable discovery that, given these structure 
constants, the functions that satisfy 
Ox,/da, = UA, (a) can be found. 

(Proofs of all the above important statements, together with proofs of 
Lie’s three fundamental theorems, are given in Eisenhart’s standard 
work Continuous Groups of Transformations, Dover Publications, 
1961). 
8.1 The rank of a group 

Let A be an operator that is a linear combination of the 
generators 
of a group, X;: 

A = aX, (sum over i), 
and let 
X = xX,. 
The rank of the group is defined as the minimum number of 
commuting, linearly independent operators of the form A. 
We therefore require all solutions of 
[A, X] =0. 


For example, consider the orthogonal group, O*(3); here 
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A = 0X, 1=1 to3, 
and 
X = xX; j=1to3 
so that 
[A, X] =a,x,[X;, X;] i,j =1 to3 
= OXF; My . 
The elements of the sets of generators are linearly independent, 
therefore 
OX; = 0 (sum over i, j,,k = 1, 2, 3) 


This equation represents the equation 


0 
0 


0 -O; A) \ 3 


The determinant of © is zero, therefore a non-trivial solution of the x,’s 
exists. The solution is given by 
x = a G=1,2,3) 
so that 
A=X. 
O*(3) is a group of rank one. 
8.2 The Casimir operator of O*(3) 
The generators of the rotation group O*(3) are the operators. Y,,’s, 
Discussed previously. They are related to the operators, J,: 
J. = -i(h/2m)Y, (k = 1, 2, 3). 


The matrix representations of the Y,,’s are 


2 


00 0 0 0-1) 010 
Y,= |001), Y, =/00 0|, Y,= |-1 0 Ol. 
0 -1 0 10.0 00 0 


The square of the total angular momentum, J is 


3 
yoy s; 
1 
= (h/2n)’ (Y,° + Y,’ + Y;’) 
= (h/2n)*(-21). 
Schur’s lemma states that an operator that is a constant multiple of I 
commutes with all matrix irreps of a group, so that 
JeJ)] = 0 .k=1,2 3. 
The operator J’ with this property is called the Casimir operator of the 
group O*(3). 

In general, the set of operators {C,} in which the elements 
commute with the elements of the set of irreps of a given group, forms 
the set of Casimir operators of the group. All Casimir operators are 
constant multiples of the unit matrix: 

C; = aj; the constants a; are 
characteristic of a particular representation of a group. 
9 
MATRIX REPRESENTATIONS OF GROUPS 

Matrix representations of linear operators are important in 
Linear Algebra; we shall see that they are equally important in 
Group Theory. 


If a group of m x m matrices 
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D,” = {D,"(g,),--De'(g,), Da '(g,)} 
can be found in which each element is associated with the 
corresponding element g, of a group of order n 

G, = {215---Bes-- Bats 
and the matrices obey 
D(g)D/(g) = Digg), 

and 

D,“"(g,) = I, the identity, 
then the matrices D,‘"(g,) are said to form an m-dimensional 
representation of G,. If the association is one-to-one we have an 
isomorphism and the representation is said to be faithful. 

The subject of Group Representations forms a very large 
branch of Group Theory. There are many standard works on this 
topic (see the bibliography), each one containing numerous 
definitions, lemmas and theorems. Here, a rather brief account is 
given of some of the more important results. The reader should 
delve into the deeper aspects of the subject as the need arises. The 
subject will be introduced by considering representations of the 
rotation groups, and their corresponding cyclic groups. 

9.1 The 3-dimensional representation of rotations in the plane 

The rotation of a vector through an angle @ in the plane is 


characterized by the 2 x 2 matrix 


cosd —sing 
R\(o) = 


sind —cos@ 
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The group of symmetry transformations that leaves an 
equilateral triangle invariant under rotations in the plane is of order 
three, and each element of the group is of dimension two 


G,~R,” ={R(0), R(20/3), R(40/3)} 
a a 
={i1 ses 0), -1/2 v3/2 |}. 
O 1) \v3/2 -1/2) (-v3/2 -1/2 
~ {123,312,231} = G,. 


These matrices form a 2-dimensional representation of C; . 

A 3-dimensional representation of C,; can be obtained as 
follows: 

Consider an equilateral triangle located in the plane and let 
the coordinates of the three vertices P,[x, y], P,[x’, y’], and P,[x”’, 
y°] be written as a 3-vector P,, = [P,, P,, P;], in normal order. 
We introduce 3 x 3 matrix operators D,;” that change the order of 
the elements of P,,, cyclically. The identity is 

P,, = D,P,;, where D,° = diag(1, 1, 1). 

The rearrangement 


P,,; > P,,[P3, P,, P,] is given by 


P,; = D,°P,;, 
where 
001 
jb cae Ue ee he 6 on 
(0 1 0 


and the rearrangement 
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P,, — P;,[P,, P;, P,] is given by 
P,, = D,P,,; where 
Oe 10 


D® = | 001 
100 


The set of matrices {D,°} = {D,°, D,, D,} is said to form a 3- 


dimensional representation of the original 2-dimensional 


representation 


{R,°}. The elements D;” have the same group 


multiplication table as that associated with C;. 


9.2 The m-dimensional representation of symmetry 


transformations in d-dimensions 


Consider the case in which a group of order n 


is represented by 


where 


Ga = Bis Bis i Sexinkis 


R,M= {R,, KR, ... RL 


R ae ‘diy G,, 


and R ,‘” is an m x m matrix representation of g,. Let P,, be a 


vector in d-dimensional space, written in normal order: 


and let 


Pa = [P,,P,, Pal, 


Po. = LP as Pos Pe! a | 
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be an m-vector, written in normal order, in which the components 


are each d-vectors. Introduce the m x m matrix operator D,“"(g,) 


such that 
Pin = D,“(g,) Pa 
Po. = D,“(g,) Pik 
Pin = De") Pas k= 1 tom, 


the number of symmetry operations. 


P ,,, is the kth (cyclic) permutation of P and D,‘"(g,) is called 


im > 
the “m-dimensional representation of g,”’. 
Infinitely many representations of a given representation can 

be found, for, if S is a matrix representation, and M is any definite 
matrix with an inverse, we can form T(x) = MS(x)M',V x €G. 
Since 

T(xy) = MS(xy)M" = MS(x)S(y)M" = MS(x)M'™MS(y)M’ 
T(xX)T(y), 


T is a representation of G. The new representation simply involves 


a change of variable in the corresponding — substitutions. 
Representations related in the manner of S and T are equivalent, 
and are not regarded as _ different representations. All 
representations that are equivalent to S are equivalent to each other, 
and they form an infinite class. Two equivalent representations will 
be written S ~ T. 
9.3 Direct sums 

If S is a representation of dimension s, and T is a 


representation of dimension t of a group G, the matrix 


Ty 


S(g) 0 
P= , (g EG) 
0 Tg) 


of dimension s + t is called the direct sum of the matrices S(g) and 
T(g), written P=S @ T. Therefore, given two representations (they 
can be the same), we can obtain a third by adding them directly. 
Alternatively, let P be a representation of dimension s + t; we 


suppose that, for all x € G, the matrix P(x) is of the form 
es 
A(x) 0 
0 B(x) 


where A(x) and B(x) are s x s and t x t matrices, respectively. (The 
0’s are s x t and t x s zero matrices). Define the matrices S and T 
as follows: S(x) = A(x) and T(x) = B(x), Vx EG. 

Since, by the group property, P(xy) = P(x)P(y), 


A(xy) 0 (A(x) 0 \{A(y) 0 
(0 BGy)) 0 BG) | 0—BG) 
A(X)A(y) 0 


0 B(x) By) 
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Therefore, S(xy) = S(x)S(y) and T(xy) = T(x)T(y), so that S and 
T are representations. The representation P is said to be 
decomposable, with components S and T. A representation is 
indecomposable if it cannot be decomposed. 

If a component of a decomposable representation is itself 
decomposable, we can continue in this manner to decompose any 
representation into a finite number of indecomposable components. 
(It should be noted that the property of indecomposablity depends 
on the field of the representation; the real field must sometimes be 
extended to the complex field to check for indecomposability). 

A weaker form of decomposability arises when we consider 


a matrix of the form 


A(x) 0 
P(x) = 
E(x) B(x) 


where A(x), and B(x) are matrices of dimensions s x s and t x t 
respectively and E(x) is a matrix that depends on x, and 0 is the s x 
t zero matrix. The matrix P, and any equivalent form, is said to be 
reducible. An irreducible representation is one that cannot be 
reduced. Every decomposable matrix is reducible (E(x) = 0), 
whereas a reducible representation need not be decomposable. 

If S and T are reducible, we can continue in this way to 
obtain a set of irreducible components. The components are 


determined uniquely, up to an equivalence. The set of distinct 


1 


irreducible representations of a finite group is (in a given field) an 
invariant of the group. The components form the building blocks of 
a representation of a group. 

In Physics, decomposable representations are generally 
referred to as reducible representations (reps). 
9.4 Similarity and unitary transformations and matrix 
diagonalization 

Before discussing the question of the possibility of reducing 
the dimension of a given representation, it will be useful to consider 
some important results in the Theory of Matrices. The proofs of 
these statements are given in the standard works on Matrix Theory. 
(See bibliography). 

If there exists a matrix Q such that 

Q'AQ =B 

then the matrices A and B are related by a similarity 
transformation. 

If Q is unitary (QQ*’ = I: Q* = (Q*)', the hermitian 
conjugate) then A and B are related by a unitary transformation. 

If AX = Q'AQ; B’ = Q'BQ; C’ = Q'CQ... then any 
algebraic relation among A, B, C ...is also satisfied by A’, B’, C’... 

If a similarity transformation produces a diagonal matrix 
then the process is called diagonalization. 

If A and B can be diagonalized by the same matrix then A 
and B commute. 

If V is formed from the eigenvectors of A then the similarity 
transformation V'AV will produce a diagonal matrix whose 


elements are the eigenvalues of A. 
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If A is hermitian then V will be unitary and therefore an 
hermitian matrix can always be diagonalized by a_ unitary 
transformation. A real symmetric matrix can always be 
diagonalized by an orthogonal transformation. 

9.5 The Schur-Auerbach theorem 

This theorem states 

Every matrix representation of a finite group is equivalent to 
a unitary matrix representation 

Let G, = {D,, D,, ....D,} be a matrix group, and let D be the 
matrix formed by taking the sum of pairs of elements 

D = > DD; 

i=1 

where D;" is the hermitian conjugate of D,. 
Since D; is non-singular, each term in the sum is positive definite. 
Therefore D itself is positive definite. Let Ly be a diagonal matrix 
that is equivalent to D, and let L,'” be the positive definite matrix 
formed by replacing the elements of L, by their positive square 
roots. Let U be a unitary matrix with the property that 

L, = UDU". 


Introduce the matrix 

S = Ly '?U, 
then SD,S" is unitary. (This property can be demonstrated by 
considering (SD,S"')(SD,S")', and showing that it is equal to the 
identity). S will transform the original matrix representation G, into 
diagonal form. Every unitary matrix is diagonalizable, and 
therefore every matrix in every finite matrix representation can be 


diagonalized. 
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9.6 Schur’s lemmas 

A matrix representation is reducible if every element of the 
representation can be put in block-diagonal form by a single 
similarity transformation. Invoking the result of the previous 
section, we need only discuss unitary representations. 

If G, = {D™(R)} is an irreducible representation of 
dimension v of a group G,, and {D(R)} is an irreducible 
representation of dimension u of the same group, G,, and if there 
exists a matrix A such that 

D™(R)A = AD" (R) VREG, 
then either 

iA=0 
or 

ii) A is a square non-singular matrix (so that v = w) 

Let the uw columns of A be written ¢,, ¢,, ... ¢, then, for any 
matrices D™ and D“’, we have 

DVA = (D™e,, D’é5..4.DE,) 
and r _ i 


AD” = (> Dc, y Dc, DY G). 


k= 1 k=1 k=1 


therefore . 


Dc, => Dc, : 
k=1 
The u c-vectors therefore span a space that is invariant under the 
irreducible set of v-dimensional matrices {D™}. The e-vectors are 


therefore the null-vector or they span a v-dimensional vector space. 
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The first case corresponds to A = 0, and the second to u=v and A # 
0. 

In the second case, the hermitian conjugates D®,*, ...D™,/ 
and D,*, ...D™,' also are irreducible. Furthermore, since 
D™(R)A = AD (R) 

D”*At = AtD™.*, 
and, therefore, following the method above we find that v => u. We 
must therefore have v = u, so that A is square. Since the v-columns 
of A span a v-dimensional space, the matrix A is necessarily non- 


singular. As a corollary, a matrix D that commutes with an 


irreducible set of matrices must be a scalar matrix. 


9.7 Characters 

If D™(R) and D™(R) are related by a_ similarity 
transformation then D®(R) gives a representation of G that is 
equivalent to D(R). These two sets of matrices are generally 
different, whereas their structure is the same. We wish, therefore, 
to answer the question: what intrinsic properties of the matrix 
representations are invariant under coordinate transformations? 


Consider 


ba [CD(R)C"];, = 2 C,,D,,(R)C,;" 


i ikl 


= 2 §,,D,(R) 


kl 


= ¥ D,,(R), the trace of D(R). 


k 


We see that the trace, or character, is an invariant under a change of 
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coordinate axes. We write the character as 
x(R) = X DCR) 
Equivalent representations have the same set of characters. 
The character of R in the representation u is written 
x(R) or [u; RI. 
The conjugate elements of G have the form S = URU’ so that 
D(R) = D(U)D(R)[D(R)I" 
therefore 
4(S) = x(R). 
We can describe G by giving its characters in a_ particular 
representation; all elements in a class have the same x. 
10 
SOME LIE GROUPS OF TRANSFORMATIONS 
We shall consider those Lie groups that can be described by 
a finite set of continuously varying essential parameters a),...a,: 
Ke SE Kise Re ie) Ta) 
A set of parameters a exists that is associated with the inverse 
transformations: 
peewee) F 
These equations must be solvable to give the x,’s in terms of the 
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X; +S. 


10.1 Linear groups 
The general linear group GL(n) in n-dimensions is given by 


the set of equations 


n 


Xx, = LY a,x, i=1 ton 


1 


j=l 
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in which det la,| # 0. 

The group contains n° parameters that have values covering 
an infinite range. The group GL(n) is said to be not closed. 

All linear groups with n > 1 are non-abelian. The group 
GL(n) is isomorphic to the group of n x n matrices; the law of 
composition is therefore matrix multiplication. 

The special linear group of transformations SL(n) in n- 
dimensions is obtained from GL(n) by imposing the condition 
detl a; |= 1. A functional relation therefore exists among the n= 
parameters so that the number of required parameters is reduced to 
fn Sf), 

10.2 Orthogonal groups 

If the transformations of the general linear group GL(n) are 

such that . 

> x,’ = invariant , 

i=1 
then the restricted group is called the orthogonal group, O(n), in n- 
dimensions. There are [n + n(n - 1)/2] conditions imposed on the n’ 
parameters of GL(n), and therefore there are n(n - 1)/2 essential 
parameters of O(n). For example, in three dimensions 

x’ = Ox; O= {0,,;: OO' =I, detO = 1, a, © R} 


where 


Ai, Az 443 
O = | ay ay ays 


43; 83. 433 
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We have 
XO +X) + X37 = X,° +X,’ +X,” > invariant under O(3). 

This invariance imposes six conditions on the original nine 
parameters, and therefore O(3) is a three-parameter group. 
10.3 Unitary groups 

If the x;’s and the a;;’s of the general linear group GL(n) are 
complex, and the transformations are required to leave xx’ invariant 
in the complex space, then we obtain the unitary group U(n) in n- 


dimensions: 


Um) = {U,,,: UU' =I, detU #0, u, 


pS Ch. 
There are 2n* independent real parameters (the real and imaginary 
parts of the a;’s), and the unitary condition imposes n + n(n-1) 
conditions on them so the group has n’ real parameters. The unitary 
condition means that 
Sela 1, 
and therefore 
lal? < 1 for all i,j. 

The parameters are limited to a finite range of values, and therefore 
the group U(n) is said to be closed. 
10.4 Special unitary groups 

If we impose the restriction detU = +1 on the unitary group 
U(n), we obtain the special unitary group SU(n) in n-dimensions: 

SU(n) = {U,,,: UU' =I, detU = +1, u,, € C}. 

The determinantal condition reduces the number of required real 


parameters to (n? - 1). SU(2) and SU(3) are important in Modern 
Physics. 
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10.5 The group SU(2), the infinitesimal form of SU(2), and the 
Pauli spin matrices 
The special unitary group in 2-dimensions, SU(2), is defined 
as 


SU(2) = {U},.: UU' =I, detU = +1, u, 


i © CH. 
It is a three-parameter group. 
The defining conditions can be used to obtain the matrix 


representation in its simplest form; let 


a b 
U= 
ea 
where a,b,c,d EC. 
The hermitian conjugate is 
Ca* ¢* 
Ue 
b* d* 
and therefore 
lal? + Ib? ac* + bd* 
UU' = 


a*c+b*d Ic? + Id?’ 
The unitary condition gives 


lal? + Ib? = Ic? + 1d? = 1 


and the determinantal condition gives 
ad — be = l. 
Solving these equations, we obtain 
c =—b* and d=a*. 


The general form of SU(2) is therefore 
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U= 
We now study the infinitesimal form of SU(2); it must have 


the structure 


1 0) {da db) [1+da db ) 
0 1 -db* da* \-6b* 1+ 6a* 
The determinantal condition therefore gives 


detU,,, = (1 + da)(1 + 6a*) +5b5b* = 1. 


To first order in the 5’s, we obtain 


1+6a*+6a = 1, 
or 
da = -da*. 
so that 
(1+8a 8b 
| OF 


—-6b* 1-6a 


The matrix elements can be written in their complex forms: 
da = 160/2 , db = 58/2 + idy/2. 


(The factor of two has been introduced for later convenience). 
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1+i80/2  8P/2 + idy/2 


Uns = 
-5B/2 + idy/2 1 - iSa/2 


Any 2x2 matrix can be written as a linear combination of the 


matrices 
F204 40 0 Oe 20), [te 0 
0 btu oOo i O 0 -1) 
as follows 
ab 1 O 0 1 O -i 1 O 
= A +B + G +D 
Lc d 0 1 1 O ul 0 0 -1 
where 


a=A+D,b=B-iC,c=B+iC,andd=A-—D. 


We then have 


if 
a b 1 0 01 
= (a+d)/2 soon! | + i(b—c)/2 


0 1 1 0) 


~ 


0 -i 
+(a—d)/2 


i 0 


1 0 


0-1 


c d 
oA 


The infinitesimal form of SU(2) can therefore be written 
I + (idy/2)o, + GdB/2)o, + (ida/2)o, 


Vine 


or 
I + (/2)X 8t, 6,,j = 1 to 3. 


Uine 


This is the Lie form. 
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The o;’s are the Pauli spin-matrices; they are the generators of the 


group SU(2): 


0 1 0-44 4 0 
0, = »,O,= »03= 
1 0 li 0 0 -1 


They play a fundamental role in the description of spin-1/2 particles 
in Quantum Mechanics. (See later discussions). 
10.6 Commutators of the spin matrices and structure constants 
We have previously introduced the commutators of the 
infinitesimal generators of a Lie group in connection with their Lie 
Algebra. In this section, we consider the commutators of the 
generators of SU(2); they are found to have the symmetric forms 
[0,,0,] = 2103, [6,,0,] =-2i0;, 
[0,,03] = —210,,[03,0,] = 2i0,, 
[0,,03] = 2i0,, [63,0] = —-2i0,. 
The commutator of any pair of the three matrices gives a 
constant, multiplied by the value of the remaining matrix, thus 
[0;, O;] = €j/210,. 
where the quantity €;,, = +1, depending on the permutations of the 
indices. (Egy, = +1,€y., = —1... etc...). 
The quantities 2ie,, are the structure constants associated with the 
group. Other properties of the spin matrices are found to be 
0° = 0, = 6; = 10,0, =i0;,0,0; = i0,,0;0, = io, 
10.7 Homomorphism of SU(2) and O*(3) 
We can form the matrix 


P = x'o = Xj0;, 


j=1,2,3 
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from the matrices 


x = [x,,X>,xX;,] ando = [0,,0,,0;] 


therefore 
X; X, — 1X, 
P = 
X,; + 1X, —X, 
We see that 
pe X,; — 1X, 
Pi = (P*)' = =P 
x1 + 1X -X;_ | 
so that P is hermitian. 
Furthermore, 
TrP = 0, 
and 
detP = -(x2 +x,’ + x,”). 


Another matrix P’ can be formed by carrying out a similarity 
transformation, thus 
P’ = UPU'",(U € SU(2)). 

A similarity transformation leaves both the trace and _ the 
determinant unchanged, therefore 

TrP = TrP’ 
and 

detP = detP’. 


This condition means that 
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or 
XP HR eR? SH we, OS RAE. 
The transformation P’ = UPU' is therefore equivalent to a three- 
dimensional orthogonal transformation that leaves xx’ invariant. 
10.8 Irreducible representations of SU(2) 
We have seen that the basic form of the 2x2 matrix 


representation of the group SU(2) is 


U= ,a,bEC; lal’ + Ib’ =1. 


Let the basis vectors of this space be 


1 0 
x)= and x, = 
0 1 
We then have 
a \ 
x ccs Ss = ax, — b*x,, 
—b* 
and 
~~ 
x, = Ux,= = bx, + a*x,, 
a* 
and therefore 
x = Ux, 


If we write a 2-dimensional vector in this complex space as 
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c = [u, v] then the components transform under SU(2) as 
ue = au + bv 
and 
vo = b*u + a*v 
therefore 
c = Uc. 

The components of the vector ec transform differently from 
those of the basis vector x — the transformation matrices are the 
transposes of each other. The vector c = [u, v] in this complex 
space is called a spinor (Cartan, 1913). 

To find an irreducible representation of SU(2) in a 3- 
dimensional space we need a set of three linearly independent basis 
functions. Following Wigner (see bibliography), we can choose the 
polynomials 

uw’, uv, and v” 


and introduce the polynomials defined by 


1+m l-m 
j=l u Vv 
f = 
: v(1+m)!(1+m)! 
where 
j =n/2 (the dimension of the space is n + 1). 
and 


m=j,j-1l,...-J. 
In the present case,n=2,j)=1,andm=0, +1. 
(The factor Wv{a + m)! (1 - m)!} is chosen to make the 
representative matrix unitary). 
We have, therefore 


f;' =w/V2 , fy)! = uv and f,,! = v’/v2. 
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A 3x3 representation of an element U € SU(2) in this space can be 
found by defining the transformation 
Uf,,'(u, v) = f,,'(u’, v’). 
We then obtain 
Uf,,'(u, v) = (au + bv)'*™(—b*u + a*v)'"™ ,m=0, +1, 


v(1 + m)!(1 - m)! 


so that 
Uf, '(u, v) = (au + bv)*/v 2 
= (au? + 2abuv + b’v?)/V 2, 
Uf,'(u, v) = (au + bv)(—b*u + a*v) 
= —ab*u’ + (lal? - Ibl’)uv + a*bv’, 
and 


Uf.,'(u, v) = (-b*u + a*vy/vV 2 
= (b**u? — 2a*b*uv + a**v?)/v2. 


We then have 


( a v2ab vb f, (#,1- ) 
-V2ab* lal? - Ib? V2a*b | | f)' | = | fy!” 
be? -V2a*b* a*? J [f,! fer 


or 
UF =F’. 
We find that UU * =I therefore U is unitary. 
This procedure can be generalized to an (n + 1)-dimensional space 
as follows: let 
f./(u, v) = wrmym »m=j,j-1,...-j. 


v(j + m)!(j - m)! 
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(Note that j = n/2 = 1/2, 1/1, 3/2, 2/1,...). 
For a given value of j, there are 2j + 1 linearly independent 
polynomials, and therefore we can form a (2j + 1) x (2j + 1) 
representative matrix of an element U of SU(2): 
Uf,(u, v) = f,/(u’, v’). 

The details of this general case are given in Wigner’s classic text. 
He demonstrates the irreducibility of the (2j + 1)-dimensional 
representation by showing that any matrix M which commutes with 
U' for all a, b such that lal’ + Ibl? = 1 must necessarily be a constant 
matrix and therefore, by Schur’s lemma, U' is an irreducible 
representation. 
10.9 Representations of rotations and the concept of tensors 

We have discussed 2- and 3-dimensional representations of 
the orthogonal group O(3) and their connection to angular 
momentum operators. Higher-dimensional representations of the 
orthogonal group can be obtained by considering a 2-index quantity 
T;; — a tensor — that consists of a set of 9 elements that transform 
under a rotation of the coordinates as follows: 

T;, > Ti = Ri-Rj.T,,. (sum over repeated indices 1, 2, 3). 
If T;, = T; (1, is symmetric), then this symmetry is an invariant 


under rotations; we have 


Tj = R,RiT im = R,.RiT nv R,R,T in Tj; - 
If TrT;; = 0 then so is TrT;;’: 
T° = R,)RiTin = (RER) pT pm = Son T om = T,, = 0 
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The set of components of a symmetric traceless 2-index 
tensor contains 5 members so that the transformation T;; > T;;° = 
R;,R,,,T,,, defines a new representation of them of dimension 5. 

Any tensor T;; can be written 

T; = au a T,)2 + (T;; - T)/2; 

the sum of a symmetric and anti-symmetric part, 

= (Ty - (&jTi/3) + OT! - 
The decomposition of the tensor T; gives any 2-index tensor in 
terms of a sum of a single component, proportional to the identity, a 
set of 3 independent quantities combined in an anti-symmetric 
tensor (Tt - T,)/2, and a set of 5 independent components of a 
symmetric traceless tensor. We write the dimensional equation 

9=10305. 

This is as far as it is possible to go in the process of decomposition: 
no other subsets of 2-index tensors can be found that preserve their 
identities under the defining transformation of the coordinates. 
Representations with no subsets of tensors that preserve their 
identities under the defining rotations of tensors are irreducible 
representations. 

We shall see that the decomposition of tensor products into 
symmetric and anti-symmetric parts is important in the Quark 
Model of elementary particles. 

The representations of the orthogonal group O(3) are found 
to be important in defining the intrinsic spin of a particle. The 
dynamics of a particle of finite mass can always be descibed in its 
rest frame (all inertial frames are equivalent!), and therefore the 


particle can be characterized by rotations. All known particles have 
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dynamical states that can be described in terms of the tensors of 
some irreducible representation of O(3). If the dimension of the 
irrep is (2j + 1) then the particle spin is found to be proportional to 
j. In Particle Physics, irreps with values of j = 0,1, 2, ... and with j 
= 1/2, 3/2, ... are found that correspond to the fundamental bosons 
and fermions, respectively. 

The three dimensional orthogonal group SO(3) (det = +1) 
and the two dimensional group SU(2) have the same Lie algebra. In 
the case of the group SU(2), the (2j + _ 1)-dimensional 
representations are allowed for both integer and half-integer values 
of j, whereas, the representations of the group SO(3) are limited to 
integer values of j. Since all the representations are allowed in 
SU(2), it is called the covering group. We note that rotations 
through » and » + 2m have different effects on the 1/2-integer 
representations, and therefore they are (spinor) transformations 
associated with SU(2). 

11 
THE GROUP STRUCTURE OF LORENTZ 
TRANSFORMATIONS 

The square of the invariant interval s, between the origin 
[0, 0, 0, 0] of a spacetime coordinate system and an arbitrary event 
x" = [x°, x', x, x*] is, in index notation 

s = x"x, = x*x’, , (sum over uw = 0, 1, 2, 3). 
The lower indices can be raised using the metric tensor 
Hao dias, —1-1,—1), 
so that 


Ss = NuwX"X” = Nyy "x™ , (Sum over uw and v). 
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The vectors now have contravariant forms. 
In matrix notation, the invariant is 
=x Hx=x Hx . 
(The transpose must be written explicitly). 
The primed and unprimed column matrices (contravariant vectors) 
are related by the Lorentz matrix operator, L 
x =Lx. 
We therefore have 
x'nx = (Lx)'n (Lx) 
= x'L'nLx. 
The x’s are arbitrary, therefore 
L'nL =n. 
This is the defining property of the Lorentz transformations. 
The set of all Lorentz transformations is the set L of all 4 x 4 
matrices that satisfies the defining property 
L={L: L'nL = n; L: all 4 x 4 real matrices; 
yn = diag(1,-1,-1,-1}. 
(Note that each L has 16 (independent) real matrix elements, and 
therefore belongs to the 16-dimensional space, Ry; 
11.1 The group structure of L 
Consider the result of two _ successive Lorentz 
transformations L, and L, that transform a 4-vector x as follows 
Xx > x” 


where 


and 
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The resultant vector x” is given by 
x = L,(L,x) 
= L,L,x 
=x 
where 
L, = L,L, (L, followed by L,). 
If the combined operation L, is always a Lorentz transformation 
then it must satisfy 
Ln. =. 
We must therefore have 


(L,L,)' (L,L,) = 


or 
L,‘(L,'nL,)L, = 4 
so that 
L,'ynl,=n, (L,,L,€L) 
therefore 


L,=L,L,E€L. 
Any number of successive Lorentz transformations may be carried 
out to give a resultant that is itself a Lorentz transformation. 
If we take the determinant of the defining equation of L, 
det(L'nL) = detn 
we obtain 
(detL)’ = 1 (detL = detL’) 
so that 
detL = +1. 
Since the determinant of L is not zero, an inverse transformation 


L" exists, and the equation LL =I, the identity, is always valid. 
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Consider the inverse of the defining equation 


(LL)! = 4" 
or 
Lig '(Lty' =n". 
Using y = 7", and rearranging, gives 
L'y(L")' =n. 


This result shows that the inverse L"' is always a member of the set 
L. 

We therefore see that 

1.IfL,andL,E&Ll,thenL,L,EL 

2.If LEL ,thenL' EL 

3. The identity I= diag(1,1,1,IEL 
and 

4. The matrix operators L obey associativity. 
The set of all Lorentz transformations therefore forms a group. 
11.2 The rotation group, revisited 

Spatial rotations in two and three dimensions are Lorentz 
transformations in which the time-component remains unchanged. 

Let R be a real 3x3 matrix that is part of a Lorentz 
transformation with a constant time-component. In this case the 
defining property of the Lorentz transformations leads to 

R'R =I, the identity matrix, diag(1,1,1). 

This is the defining property of a three-dimensional orthogonal 
matrix 

If x = [X,, X,, X;] is a three-vector that is transformed under 
R to give x’ then 


xx’ = x'R'Rx 
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Sx x Sx, +e, + x, 
= invariant under R. 
The action of R on any three-vector preserves length. The set of all 
3x3 orthogonal matrices is denoted by O(3): 
O(3) = {R: R'R=L,r; 


yj 


€ R}. 
The elements of this set satisfy the four group axioms. 

The group O(3) can be split into two parts that are said to be 
disconnected: one with detR = +1 and the other with detR = -1. 


The two parts are written 


O*(3) = {R: detR = +1} 


and 


O (3) 


If we define the parity operator , P, to be the operator that 


{R: detR =-1}. 


reflects all points in a 3-dimensional cartesian system through the 


origin then 


The two parts of O(3) are related by the operator P: 
if R € O*(3) then PR € O(3), 
and 
if RC © O(3) then PR’ € O*(3). 
We can therefore consider only that part of O(3) that is a group, 
namely O*(3), together with the operator P. 
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11.3 Connected and disconnected parts of the Lorentz group 
We have shown previously, that every Lorentz 
transformation, L, has a determinant equal to +1. The matrix 
elements of L change continuously as the relative velocity changes 
continuously. It is not possible, however, to move continuously in 
such a way that we can go from the set of transformations with 
detL = +1 to those with detL = —1; the set {L: detL = +1} is 
disconnected from the set {L: detL = -1}. 
If we write the Lorentz transformation in its component form 
L>L*, 
where u = 0,1,2,3 labels the rows, and v = 0,1,2,3 labels the 
columns then the time component L°, has the values 
L°,=+1l orL, <-l. 
The set of transformations can therefore be split into four 
disconnected parts, labeled as follows: 
{L',} = {L: detL = +1, L’, = +1} 
{L' } ={L: detL = -1, L°,2 +1} 
{L',} ={L: detL = +1, L’, s-1}, 
and 
{L' } ={L: detL =-1, L®, s-1}. 
The identity is in {L',}. 
11.4 Parity, time-reversal and orthochronous transformations 
Two discrete Lorentz transformations are 
1) the parity transformation 
P={P:r—-r,t—t} 
= diag(1,-1,-1,-1), 


and 
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ii) the time-reversal transformation 
T=({T:r—r,t—-t} 
= diag(-1, 1,1, 1}. 
The disconnected parts of {L} are related by the 


transformations that involve P, T, and PT, as shown: 


PT 


tH 


Connections between the disconnected parts of Lorentz 
transformations 

The proper orthochronous transformations are in the group 
L',. It is not necessary to consider the complete set {L} of Lorentz 
transformations — we need consider only the subset {L',} that 
forms a group by itself and either P, T, or PT combined. 
Experiments have shown clear violations under the parity 
transformation, P and violations under T have been inferred from 
experiment and theory, combined. However, not a_ single 
experiment has been carried out that shows a violation of the proper 


orthochronous transformations, {L',}. 
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12 
ISOSPIN 

Particles can be distinguished from one another by their 
intrinsic properties: mass, charge, spin, parity, and their electric and 
magnetic moments. In our on-going quest for an understanding of 
the true nature of the fundamental particles, and their interactions, 
other intrinsic properties, with names such as “isospin” and 
“strangeness”, have been discovered. The intrinsic properties are 
defined by quantum numbers; for example, the quantum number a 
is defined by the eigenvalue equation 

Ao = a 
where A is a linear operator, ¢ is the wavefunction of the system in 
the zero-momentum frame, and a is an eigenvalue of A. 

In this chapter, we shall discuss the first of these new 
properties to be introduced, namely, isospin. 

The building blocks of nuclei are protons (positively 
charged) and neutrons (neutral). Numerous experiments on the 
scattering of protons by protons, and protons by neutrons, have 
shown that the nuclear forces between pairs have the same strength, 
provided the angular momentum and spin states are the same. 
These observations form the basis of an important concept — the 
charge-independence of the nucleon-nucleon force. (Corrections 
for the coulomb effects in proton-proton scattering must be made). 
The origin of this concept is found in a new symmetry principle. In 
1932, Chadwick not only identified the neutron in studying the 


interaction of alpha-particles on beryllium nuclei but also showed 
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that its mass is almost equal to the mass of the proton. (Recent 
measurements give 

mass of proton = 938-27231(28) MeV/c” 
and 

mass of neutron = 939-56563(28) MeV/c’) 
Within a few months of Chadwick’s discovery, Heisenberg 
introduced a theory of nuclear forces in which he considered the 
neutron and the proton to be two “states” of the same object — the 
nucleon. He introduced an intrinsic variable, later called isospin, 
that permits the charge states (+, 0) of the nucleons to be 
distinguished. This new variable is needed (in addition to the 
traditional space-spin variables) in the description of nucleon- 
nucleon scattering. 

In nuclei, protons and neutrons behave in a remarkably 
symmetrical way: the binding energy of a nucleus is closely 
proportional to the number of neutrons and protons, and in light 
nuclei (mass number < 40), the number of neutrons can be equal to 
the number of protons. 

Before discussing the isospin of particles and nuclei, it is 
necessary to introduce an extended Pauli Exclusion Principle. In its 
original form, the Pauli Exclusion Principle was introduced to 
account for features in the observed spectra of atoms that could not 
be understood using the then current models of atomic structure: 
no two electrons in an atom can exist in the same quantum state 


defined by the quantum numbers n, ¢, m, m, where n is the 


principal quantum number, ¢ is the orbital angular momentum 
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quantum number, m, is the magnetic quantum number, and m, is the 
spin quantum number. 

For a system of N particles, the complete wavefunction is 
written as a product of single-particle wavefunctions 

W 1, 2,..N) = wd) p(2)...p(N). 
Consider this form in the simplest case — for two identical 
particles. Let one be in a state labeled W, and the other in a state 
W,. For identical particles, it makes no difference to the probability 
density [WI of the 2-particle system if the particles are exchanged: 
M1, 2)P = RW, LP, 

(the W’s are not measurable) 
so that, either 


W2,1) = Wd, 2) (symmetric) 


or 
W2,1) =-WC, 2) (anti-symmetric). 
Let 
We = y,())y,(2) C ana, 2 in b) 
and 


Wr = y,(2)wc1) (2 ina, 1 in b). 

The two particles are indistinguishable, therefore we have no way 
of knowing whether Y, or W,, describes the system; we postulate 
that the system spends 50% of its time in WY, and 50% of its time in 
Wi. The two-particle system is considered to be a linear 
combination of P, and ;;: 

we have, therefore, either 

Boyan = (/V2){p,(Lp,(2) + p,(2)p,(1)} (BOSONS) 


or 
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(The coefficient (1/V2) normalizes the sum of the squares to be 1). 


Exchanging 1 < 2 leaves © unchanged, whereas exchanging 


symm 
particles 1<> 2 reverses the sign of Wantisymm « 

If two particles are in Wy, both particles can exist in the same 
state with a=b. If two particles are in V,, , and a =b, we have 
Ws = 0 — they cannot exist in the same quantum state. Electrons 
(fermions, spin = (1/2)h) are described by anti-symmetric 
wavefunctions. 

We can now introduce a more general Pauli Exclusion 
Principle. Write the nucleon wavefunction as a product: 

V(X, gd) = WODOn(@) . 
where 
X = xX(r, 8) 

in which r is the space vector, s is the spin, and q is a charge or 
isospin label. 

For two nucleons, we write 

P(X1, G3 Xo» Go), 


for two protons: 


Po = WiKi X2)On(Pi)On(P2), 


for two neutrons: 


Pon = Wo(K1, X)OnM)On(M3), 


and for an n-p pair: 


Pip = W3(X1> X2)On(Pi On (Ny) 


or 


= Wa(K1, Xr) On(1)On(Pr). 


107 


If we regard the proton and neutron as different states of the same 
object, labeled by the “charge or isospin coordinate”, q, we must 
extend the Pauli principle to cover the new coordinate: the total 
wavefunction is then 
(X15 Wis Xo» Io) = —VXo+ dos X1> Qi) - 

It must be anti-symmetric under the full exchange. 
For a 2p- or a 2n-pair, the exchange q,;< q, is symmetrical, and 
therefore the space-spin part must be anti-symmetrical. 

For an n-p pair, the symmetric (S) and anti-symmetric (AS) 
“isospin” wavefunctions are 

Do @s = (V2) {x(P))ON(M2) + ON(2,)ON(P2)} 

(symmetric under q, <> q,), 

and therefore the space-spin part is anti-symmetrical, 

MT) ®xs = (1V2){x(P/)ON(M2) ~ Oy, )ON(P2)} 

(anti-symmetric under q, <> q,), 

and therefore the space-spin part is symmetrical. 
We shall need these results in later discussions of the symmetric 
and anti-symmetric properties of quark systems. 
12.1 Nuclear B-decay 

Nuclei are bound states of neutrons and protons. If the 
number of protons in a nucleus is Z and the number of neutrons is N 
then the mass number of the nucleus is A = N + Z. Some nuclei are 
naturally unstable. A possible mode of decay is by the emission of 
an electron (this is B-decay — a process that typifies the 
fundamental “weak interaction’). 
We write the decay as 


AXy > “yiXn, te'+v, (B -decay) 
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or, we can have 
Ay > eX tet +, (B* - decay). 
A related process is that of electron capture of an orbital electron 
that is sufficiently close to the positively charged nucleus: 
e+ Ky Ae Xn + Vee 
Other related processes are 
+4 Xy > “7 1Xyi te* 
and 
Vet *Xy > SeXy te. 

The decay of the free proton has not been observed at the present 
time. The experimental limit on the half-life of the proton is > 10°’ 
years. Many current theories of the microstructure of matter predict 
that the proton decays. If, however, the life-time is > 10° - 10° 
years then there is no realistic possibility of observing the decay 
directly (The limit is set by Avogadro’s number and the finite 
number of protons that can be assembled in a suitable experimental 
apparatus). 

The fundamental B-decay is that of the free neutron, first 
observed in 1946. The process is 

no pi+e tv,’ , ty, = 10-37 + 0-19 
minutes. 

This measured life-time is of fundamental importance in Particle 
Physics and in Cosmology. 

Let us set up an algebraic description of the p-decay process, 
recognizing that we have a 2-state system in which the 
transformation p < n occurs: 


In the B -decay of a free neutron 
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n—> p'+e+v,, 
and in the B*-decay of a proton, bound in a nucleus, 
p ~nt+e+y,. 
12.2 Isospin of the nucleon 
The spontaneous transformations p < n observed in $-decay 


leads us to introduce the operators tT, that transform p <> n: 


T.0, = 9, T19, = O, (eliminates a 
proton) 
and 

th, = 9, 7, = O, (eliminates a 
neutron). 


Since we are dealing with a two-state system, we choose the 


“isospin” parts of the proton and neutron wavefunctions to be 


o(p) = and $(n) = 
0, 1 


in which case the operators must have the forms: 


S 


a and t= 
0 0 1 O 


They are singular and non-hermitian. We have, for example 
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and 


(v, removes a proton). 
To make the present algebraic description analogous to the two- 
state system of the intrinsic spin of the electron, we introduce linear 


combinations of the Tt, : 


0 1 
T=T+T = = O,, a Pauli matrix, 
1 O 
and 
0 -i 
T S1 = = = O,, a Pauli matrix. 
i 0 


A third (diagonal) operator is, as expected 


= O;, a Pauli matrix. 


a 
we 

I 

| 


The three operators {t,, T,, 13} therefore obey the 
commutation relations 


[v,/2,t,/2] =i€,,t//2 , 
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where the factor of(1/2) is introduced because of the 2:1 
homomorphism between SU(2) and O*(3): the vector operator 

t= 7/2 
is called the isospin operator of the nucleon. 

To classify the isospin states of the nucleon we may use the 
projection of t on the 3rd axis, t;. The eigenvalues, t,, of t, 
correspond to the proton (t, = +1/2) and neutron (t,; = —1/2) states. 
The nucleon is said to be an isospin doublet with isospin quantum 
number t = 1/2. (The number of states in the multiplet is 2t+ 1 = 2 
for t = 1/2). 

The charge, Qy of the nucleon can be written in terms of the 
isospin quantum numbers: 

Qu = a(t; +(1/2)) = gor, 
where q is the proton charge. (It is one of the major unsolved 
problems of Particle Physics to understand why the charge on the 
proton is equal to the charge on the electron). 
12.3 Isospin in nuclei. 

The concept of isospin, and of rotations in isospin space, 
associated with individual nucleons can be applied to nuclei — 
systems of many nucleons in a bound state. 

Let the isospin of the ith-nucleon be t;, and let t, = t; /2. The 
operator of a system of A nucleons is defined as 

Pes tS a2 
The eigenvalue of T; of the isospin operator T, is the sum of the 


individual components 


T; res ts = em T3,/2 


(Z—N)/2. 
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The charge, Qy of a nucleus can be written 
Qx = qdoni (t+ D/2 
= q(T; + A/2) . 

For a given eigenvalue T of the operator T, the state is (2T + 1)-fold 
degenerate. The eigenvalues T; of T, are 

T, = -T,=T +1,....0,...T + 1,T. 
If the Hamiltonian H of the nucleus is charge-independent then 

[H,T] = 0 

and T is said to be a good quantum number. In light nuclei, where 
the isospin-violating coulomb interaction between pairs of protons 
is a small effect, the concept of isospin is particularly useful. The 
study of isospin effects in nuclei was first applied to the observed 
properties of the lowest-lying states in the three nuclei with mass 
number A = 14: '"C, '*N, and '*O. The spin and parity of the ground 
state of '*C, the first excited state of '“N and the ground state of '“O 
are measured to be 0°; these three states are characterized by T = 1. 
The ground state of '"N has spin and parity 1*; it is an isospin 
singlet (T = 0). 
The relative energies of the states are shown in the following 


diagram: 


113 


Energy (MeV) 


A 
6 ie 
0* T=1,T;=1 
4 2s 
- 0* T=1,T;=0 
Ba 
0* T=1,T,;=-1 It T=0,T;=0 
gals eee 


An isospin singlet (T = 0) and an isospin triplet (T = 1) in 
the A = 14 system. In the absence of the coulomb interaction, the 


three T = 1 states would be degenerate. 


12.4 Isospin and mesons 

We have seen that it is possible to classify the charge states 
of nucleons and nuclear isobars using the concept of isospin, and 
the algebra of SU(2). It will be useful to classify other particles, 


including field particles (quanta) in terms of their isospin. 
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Yukawa (1935), first proposed that the strong nuclear force 
between a pair of nucleons is carried by massive field particles 
called mesons. 

Yukawa’s method was a masterful development of the theory of the 
electromagnetic field to include the case of a massive field particle. 
If w, is the “meson wavefunction” then the Yukawa differential 
equation for the meson is 
0,0" p, + (EX/acyy, = 0. 
where 
0,0" = (1/c*)d’/at? - V* . 
The r-dependent (spatial) form of V’ is 
V? = (1/t’)d/dr(r’d/dr) 
The static (time-independent) solution of this equation is readily 
checked to be 
Wr) = (-g’/r)exp(-t/ty) 
where 
ty = h/m,c = he/m,c’ = hc/E,’, 
so that 
ty? = (E,/acy 
The “range of the nuclear force” is defined by the condition 
r= ty = fA/m,c ~2 x10" cm. 
This gives the mass of the meson to be close to the measured value. 
It is important to note that the “range of the force” « 1/(mass of the 
field quantum). In the case of the electromagnetic field, the mass of 
the field quantum (the photon) is zero, and therefore the force has 


an infinite range. 
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The mesons come in three charge states: +, -, and 0. The 
mesons have intrinsic spins equal to zero (they are field particles 
and therefore they are bosons), and their rest energies are measured 
to be 

E,.” = 139-5 MeV and E,,” = 135-6 MeV. 
They are therefore considered to be members of an isospin triplet: 
Pelt: Sth: 
In Particle Physics, it is the custom to designate the isospin 
quantum number by I, we shall follow this convention from now on. 

The third component of the isospin is an additive quantum 
number. The combined values of the isospin projections of the two 
particles, one with isospin projection I,” , and the other with I,, is 

L¢? = 7,0 4+ 1,2. 
Their isospins combine to give states with different numbers in each 
multiplet. For example, in pion (meson)-nucleon scattering 
ma + N-— states with I," *” = (3/2) or (1/2). 
These values are obtained by noting that 
L,°? = 1, and Iy” = 1/2, so that 
1.) + by? = (41,0) + (41/2) 


3/2, an isospin quartet, or 
= 1/2, an isospin doublet. 
Symbolically, we write 
3@2 = 402. 
(This is the rule for forming the product (21,‘” + 1)@(2I1,” + 1). 
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13 

GROUPS AND THE STRUCTURE OF MATTER 
13.1 Strangeness 

In the early 1950’s, our understanding of the ultimate 
structure of matter seemed to be complete. We required neutrons, 
protons, electrons and neutrinos, and mesons and photons. Our 
optimism was short-lived. By 1953, excited states of the nucleons, 
and more massive mesons, had been discovered. Some of the new 
particles had completely unexpected properties; for example, in the 
interaction between protons and a-mesons (pions) the following 


decay mode was observed: 


Proton (p*) 
oe we ee (n° ) 


—“ ND 
AN 


1 nr 
wn. Kaon (K*) Pion (z°) 
Pion (") 
ft ft 
Initial interaction Final decay 
lasts ~10-* seconds takes ~107'° seconds 


(Strong force acting) (Weak force acting) 


Gell-Mann, and independently Nishijima, proposed that the kaons 
(heavy mesons) were endowed with a new intrinsic property not 
affected by the strong force. Gell-Mann called this property 


“strangeness”. Strangeness is conserved in the strong interactions 


1 BF 


but changes in the weak interactions. The Gell-Mann - Nishijima 
interpretation of the strangeness-changing involved in the proton- 


pion interaction is 


p* (S =0) x*(S=-) 
n° (S = 0) 
*% J K'(S=+l) * 
n* (S =0) 
n* (S = 0) 
fp fh 
AS =0 AS = | 


In the strong part of the interaction, there is no change in the 
number defining the strangeness, whereas in the weak part, the 
strangeness changes by one unit. Having defined the values of S for 
the particles in this interaction, they are defined forever. All 
subsequent experiments involving these objects have been 
consistent with the original assignments. 
13.2 Particle patterns 

In 1961, Gell-Mann, and independently Ne’eman, introduced 
a scheme that classified the strongly interacting particles into family 
groups. They were concerned with the inclusion of “strangeness” in 


their theory, and therefore they studied the arrangements of 
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particles in an abstract space defined by their electric charge and 
strangeness. The common feature of each family was chosen to be 
their intrinsic spin; the family of spin-1/2 baryons (strongly 
interacting particles) has eight members: n°, p* .>* .X° = .Z° , and 
A° . Their strangeness quantum numbers are: S = 0: n°, p* ; S =-1: 
y*,=° and A° ; and § = -2: E°~ . If the positions of these eight 
particles are given in charge-strangeness space, a remarkable 


pattern emerges: 


Strangeness 


n° p* 


There are two particles at the center, each with zero charge and 
strangeness —1; they are the ©° and the A° (They have different rest 


masses). An important set of particles consists of all baryons with 
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spin 3/2. At the time, there were nine known particles in this 
category: A°, At!, AY’, ©*°, y**!, E°, and &! . They have the 
following pattern in charge-strangeness space: 


Charge: —] 0 +1 +2 


Strangeness 


—2 


The symmetry pattern of the family of spin-3/2 baryons, shown by 
the known nine objects was sufficiently compelling for Gell-Mann, 


in 1962, to suggest that a tenth member of the family should exist. 


120 


Furthermore, if the symmetry has a physical basis, the tenth 
member should have spin-3/2, charge —1, strangeness —3, and its 
mass should be about 150MeV greater than the mass of the =° 
particle. Two years after this suggestion, the tenth member of the 
family was identified in high-energy particle collisions; it decayed 
via weak interactions, and possessed the predicted properties. This 
could not have been by chance. The discovery of the Q° particle 
was crucial in helping to establish the concept of the Gell-Mann — 
Ne’eman symmetry model. 

In addition to the symmetries of baryons, grouped by their 
spins, the model was used to obtain symmetries of mesons, also 
grouped by their spins. 

13.3 The special unitary group SU(3) and particle structure 

Several years before the work of Gell-Mann and Ne’eman, 
Sakata had attempted to build-up the known particles from 
{neutron-proton-lambda’} triplets. The lambda particle was 
required to “carry the strangeness”. Although the model was shown 
not to be valid, Ikeda et al. (1959) introduced an important 
mathematical analysis of the three-state system that involved the 
group SU(3). The notion that an underlying group structure of 
elementary particles might exist was popular in the early 1960’s. 
(Special Unitary Groups were used by J. P. Elliott in the late 1950’s 
to describe symmetry properties of light nuclei). 

The problem facing Particle Physicists, at the time, was to 
find the appropriate group and its fundamental representation, and 
to construct higher-dimensional representations that would account 


for the wide variety of symmetries observed in charge-strangeness 
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space. We have seen that the charge of a particle can be written in 
terms of its isospin, a concept that has its origin in the charge- 
independence of the nucleon-nucleon force. When appropriate, we 
shall discuss the symmetry properties of particles in isospin- 
strangeness space. 

Previously, we discussed the properties of the Lie group 
SU(2). It is a group characterized by its three generators, the Pauli 
spin matrices. Two-state systems, such as the electron with its 
quantized spin-up and spin-down, and the isospin states of nucleons 
and nuclei, can be treated quantitatively using this group. The 
symmetries of nucleon and meson families discovered by Gell- 
Mann and Ne’eman, implied an underlying structure of nucleons 
and mesons. It could not be a structure simply associated with a 
two-state system because the observed particles were endowed not 
only with positive, negative, and zero charge but also with 
strangeness. A_ three-state system was therefore considered 
necessary, at the very least; the most promising candidate was the 
group SU(3). We shall discuss the infinitesimal form of this group, 
and we shall find a suitable set of generators. 

13.3.1 The algebra of SU(3) 
The group of special unitary transformations in a 3- 
dimensional complex space is defined as 
SU(3) = {U;,; : UU" =I, detU = +1, u; © C}. 
The infinitesimal form of SU(3) is 
SU(3)ing = 1+ i60,A,/2 ,j = 1 to 8. 


(There are n’ — 1 = 8 generators). 
g 


122 


The quantities da, are real and infinitesimal, and the 3x3 matrices 
A, are the linearly independent generators of the group. The 
repeated index, j, means that a sum over j is taken. 
The defining properties of the group restrict the form of the 
generators. For example, the unitary condition is 
UU' = (1+ id0,A,/2)(1 - id0,A'/2) 
=I- id0,A,'/2 + idajA,/2 
to lst-order, 
=Tifke=A,'. 
The generators must be hermitian. 
The determinantal condition is 
det = +1; and therefore TrA,; = 0. 
The generators must be traceless. 
The finite form of U is obtained by exponentiation: 
U=exp{iaaA,/2}. 
We can find a suitable set of 8 generators by extending the 
method used in our discussion of isospin, thus: 
Let three fundamental states of the system be chosen in the 


simplest way, namely: 


1 (0) 0 
u= she=|sfme 0 
0 0 


1 
If we wish to transform v — u, we can do so by defining the 


operator A,: 


0 1 0)/0 1 
A,v=u,|0 0 0 1= 1/0 
00 0'/0 0 
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We can introduce other operators that transform the states in pairs, 


thus 


io) 
fo 4 
Se 0 Os "Or 


0 
0 
0 
0 
0 


one a) 
oO QOowWwr 


100 0 0 0 
These matrices are singular and non-hermitian. In the discussion of 
isospin and the group SU(2), the non-singular, traceless, hermitian 
matrices t,, and t, are formed from the raising and lowering 
operators t, matrices by introducing the complex linear 
combinations 

T, = T,+T_=O0,andT, = i(t, — T,) =O). 
The generators of SU(3) are formed from the operators A,, B,, C.. 
by constructing complex linear combinations. For example: 
the isospin operator tT, = O, = T,+T_,a generator of SU(2), 


becomes 


0 
o, 0 | = A,+A_ =A,,a generator of SU(3). 
0 0 0 
Continuing in this way, we obtain 


A, = 4,/2 + id,/2, 
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where 
0 
: o, 0), 
0.0. 0 
and 


C,+C. =A, C.-C. = -id,, 
B,+B_=2,,B,-B_= id. 


The remaining generators, A; and A, are traceless, diagonal, 3x3 


matrices: 
( 0 | fi <0 <0 
A,=| o, O |, A,g= |O0 1 OO}. 
00 0) 0 0 -2 
The set of matrices {A,, .....Ag} are called the Gell-Mann matrices, 


introduced in 1961. They are normalized so that 
Tr(AjA,) = 264. 


The normalized form of A, is therefore 


1 0 0 
A, = (1/3) 0 1 0 
(0 0 -2 
If we put F; = A,/2, we find 
A, =F, +iF,, 
B, =F, + iF,, 


and 
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Let A, = F;, B; = —F,/2 + (V3/4)F, , and C, = (-1/2)F, - (V3/4)F,, 
so that 
A,+B,+C,=0. 

The last condition means that only eight of the nine operators are 
independent. 

The generators of the group are readily shown to obey the 
Lie commutation relations 

[F;, F] = if, F, ,ij.k = 1 to 8. 
where the quantities fj, are the non-zero structure constants of the 
group; they are found to obey 
cae 

and the Jacobi identity. 
The commutation relations [F;, F,] can be written in terms of the 
operators A,, ...Some typical results are 

[A,, A_] = 2A,, [A,, A,] =—A,, [A_, A,] = +A_, 

[Ay;- Bs] = 03. [Ay,C,)=0;. [Bs C.]) =0 

[B,, B_] = 2B,, [B,, B,] = —B_, [B_, B,] =+B_etc. 

The two diagonal operators commute: 

[F;, F,]=0. 
The operators F,, F,, and F, contain the 2x2 isospin operators (Pauli 
matrices), each with zeros in the third row and column; they obey 
the commutation relations of isospin. We therefore make the 
identifications 
F, =I1,, F, = L, and F, =I, 

where the I,’s are the components of the isospin. 

Particles that experience the strong nuclear interaction are 


called hadrons; they are separated into two sets: the baryons with 
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half-integer spins and the mesons with zero or integer spins. 
Particles that do not experience the strong interaction are called 
leptons. In order to quantify the difference between baryons and 
leptons, it has been found necessary to introduce the baryon number 
B = +1 to denote a baryon, B = —1 to denote an anti-baryon, and 
B = O for all other particles. Leptons are characterized by the 
lepton number L = +1, anti-leptons are assigned L = —-1, and all 
other particles are assigned L = 0. It is a present-day fact, based 
upon numerous observations, that the total baryon and lepton 
number in any interaction is conserved. For example, in the decay 
of the free neutron we find 
n =ptte +v2 

B=+1=+1+0 +0 

L= 0=0 +1 +(€1). 
The fundamental symmetries in Nature responsible for these 
conservation laws are not known at this time. These conservation 
laws may, in all likelihood, be broken. 

In discussing the patterns of baryon families in charge- 
strangeness space, we wish to incorporate the fact that we are 
dealing with baryons that interact via the strong nuclear force in 
which isospin and strangeness are conserved. We therefore choose 
to describe their patterns in isospin-hypercharge space, where the 
hypercharge Y is defined to include both the strangeness and the 
baryon attribute of the particle in an additive way: 

Y=BrHs. 
The diagonal operator F, is therefore assumed to be directly 
associated with the hypercharge operator: 


F, = (V3/2)Y. 
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Because I, and Y commute, states can be chosen that are 
simultaneous eigenstates of the operators F; and F,. Since no other 
SU(3) operators commute with I, and Y, no other additive quantum 
numbers are associated with the SU(3) symmetry. The operators F,, 

. F, are considered to be new constants-of-the-motion of the 
strong interaction Hamiltonian. 
13.4 Irreducible representations of SU(3) 

In an earlier discussion of the irreducible representations of 
SU(2), we found that the commutation relations of the generators of 
the group were satisfied not only by the fundamental 2x2 matrices 
but also by matrices of higher dimension [(2J + 1) © (2J + 1)], 
where J can have the values 1/2, 1, 3/2, 2,...The J-values 
correspond to the spin of the particle whose state is given by a 
spinor (a column vector with special transformation properties). In 
the 2x2 representation, both covariant and contravariant spinors are 
allowed: 

i) covariant spinors (with lower indices) are written as 2- 


component columns that transform under U € SU(2) as 


a, = U!a,, 
where 
ay 
a=| |, 
a, 
and 
ii) contravariant spinors (with upper indices) are written as 


2-component rows that transform as: 
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8” - BU; r 


where 


B = (b',b’). 
The co- and contravariant spinors are transformed with the aid of 


the anti-symmetric tensors €;; and e'!. For example, 


B; = &; B’ 
transforms as a covariant spinor with the form 
(23 
b2 
B; = 
_p! 


The higher-dimensional representations are built up from the 
fundamental form by taking tensor products of the fundamental 
spinors a, , B! , or B, and by symmetrizing and anti-symmetrizing 
the result. We state, without proof, the theorem that is used in this 
method: 
when a tensor product of spinors has been broken down into its 
symmetric and anti-symmetric parts, it has been decomposed into 
irreducible representations of the SU(n). (See Wigner’s standard 
work for the original discussion of the method, and de Swart in 
Rev. Mod. Phys. 35, (1963) for a detailed discussion of tensor 
analysis in the study of the irreps of SU(n)) 

As an example, we write the tensor product of two covariant 


spinors 
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uw; and v;, in the following way 

U;Ov; = wiv; = (uv; + w,V,)/2 + (wiv, — p;V;)/2 
There are four elements associated with the product (i,j can have 
values | and 2). 

The symmetric part of the product has three independent 
elements, and transforms as an object that has spin J=1. (There are 
2J + 1 members of the symmetric set). The anti-symmetric part has 
one element, and therefore transforms as an object with spin J = 0. 
This result is familiar in the theory of angular momentum in 


Quantum Mechanics. The explicit forms of the four elements are: 


| J,=+1: py, 
J=1 J,= 0: (1/V2)(u,v, + pv,) 
| J,=-1: pov, 
and 
J=0 J,=0: (1/V2)(u,v,-p,Vv)). 


Higher-dimensional representations are built up from the tensor 
products of covariant and contravariant 3-spinors, a and B 
respectively. The products are then written in terms of their 
symmetric and anti-symmetric parts in order to obtain the 
irreducible representations. For example, the product 
a.B', ij = 1,2,3, can be written 
a8! = (a,B' - (1/3)8/0,8") + (1/3)5,/0,B" 

in which the trace has been separated out. The trace is a zero-rank 
tensor with a single component. The other tensor is a traceless, 
symmetric tensor with eight independent components. The 
decomposition is written symbolically as: 


3@3 = 861. 
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We can form the tensor product of two covariant 3-spinors, 

Uv; as follows: 
uv; = (1/2)(uy; + wv) + 1/2)(uyv,; — wv), iJ = 1,2,3. 
Symbolically, we have 
3@3 = 603, 
in which the symmetric tensor has six components and the anti- 
symmetric tensor has three components. 
Other tensor products that will be of interest are 
3@®3@3=1008 0801, 
and 
8®8=2701008 1098 O8 Ol. 

The appearance of the octet “8” in the 3 ® 3 decomposition 
(recall the observed octet of spin-1/2 baryons), and the decuplet 
“10” in the triple product 3 ® 3 @ 3 decomposition (recall the 
observed decuplet of spin-3/2 baryons), was of prime importance in 
the development of the group theory of “elementary” particles. 
13.4.1 Weight diagrams 

Two of the Gell-Mann matrices, A, and Ax, are diagonal. We 
can write the eigenvalue equations: 

A3U = OU, A;V = G,V, and A,w = O,,w, 
and 
A,U = BU, AsV = Biv, andA,w =B,w, 

where a, and £, are the eigenvalues. 

Let a and b be normalization factors associated with the 


operators A, and Ag, repectively, so that 
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fa 0 0 b 0 0 
rn, =|0-a O| and A,N=/0 b Ol. 
\0 0 0 \0 0 —2b 
If u=[1,0,0], v = [0, 1,0] and w = [0, 0, 1] (columns), we find 
A;\u= au, A,\u= _ bu, 
X3\v = -av, A, v= bv, 


X3\w= Ow, A,\w =—2bw. 
The weight vectors are formed from the pairs of eigenvalues: 
[x,, B.] = [a, b], 
[o,, By] = [-a, b], 
[a,,, By] = (0, -2b]. 


A weight diagram is obtained by plotting these vectors in the a—B 
space: 


This weight diagram for the fundamental “3” representation of 


SU(3) was well-known to Mathematicians at the time of the first 
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use of SU(3) symmetry in Particle Physics. It was to play a key 
role in the development of the quark model. 
13.5 The 3-quark model of matter 

Although the octet and decuplet patterns of hadrons of a 
given spin and parity emerge as irreducible representations of the 
group SU(3), major problems remained that resulted in a great deal 
of scepticism concerning the validity of the SU(3) model of 
fundamental particles. The most pressing problem was: why are 
there no known particles associated with the fundamental triplets 3, 
3 of SU(3) that exhibit the symmetry of the weight diagram 
discussed in the last section? In 1964, Gell-Mann, and 
independently, Zweig, proposed that three fundamental entities do 
exist that correspond to the base states of SU(3), and that they form 
bound states of the hadrons. That such entities have not been 
observed in the free state is related to their enormous binding 
energy. The three entities were called quarks by Gell-Mann, and 
aces by Zweig. The Gell-Mann term has survived. The anti-quarks 
are associated with the conjugate 3 representation. The three 
quarks, denoted by u, d, and s (u and d for the up- and down-isospin 


states, and s for strangeness) have highly unusual properties; they 


are 
Label B Y I I, Q=1,+vY/2 S=yY-B 
u iB> 43 io - Gio 496 0 
d Ve: 11/82 <A> ts HIB 0 
s 1/3 2/33 «OO O “2173, 2i 
s his: 2273 0 0 +1/3 +1 
d 2 218, W- eio-- is 0 
u ASM Wye <I 298 0 
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The quarks occupy the following positions in I,— Y space 


ae 


These diagrams have the same relative forms as the 3 and 3 weight 
diagrams of SU(3). 

The baryons are made up of quark triplets, and the mesons 
are made up of the simplest possible structures, namely quark—anti- 
quark pairs. The covariant and contravariant 3-spinors introduced 
in the previous section are now given physical significance: 

uw = [u, d, s], a covariant column 3- 
spinor, 
and 


u, d, s), a contravariant row 3- 


<i) 
H 
Yai, 


spinor. 
where u = [1, 0, 0], d = [0, 1, 0], and s = [0, 0, 1] represent the 
unitary symmetry part of the total wavefunctions of the three 
quarks. 

The operators A,, B,, and C, are now viewed as operators 


that transform one flavor (type) of quark into another flavor: 
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A,=Ld,) ~L+1, 
B,. = U,(U;) — U,; + 1, called the U-spin operator, and 
C, = V.(V;) = V; + 1, called the V-spin operator, where 
L(-1/2) > 1/2:d>u 
1 (41/2) ~-1/2:u-d 
U,--1/2) > 1/2:s-d 
U (41/2) ~-1/2:d—>s 
V,(-1/2) > 1/2:u—>s 
V (41/2) > -1/2:s—u. 


Quarks can be characterized by the quantum numbers I,, U;, V, 


U; -V; 


—I, 


s(0, -—%, 4) 
V; -U, 
The members of the octet of mesons with J’ = O° are formed from 


qq-pairs that belong to the fundamental 3, 3 representation of the 
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quarks. The x” and 7° mesons are linear combinations of the qq 


states, thus 


The nonet formed from the tensor product 3 ® 3 is split into an 
octet that is even under the label exchange of two particles, and a 
singlet that is odd under label exchange: 

3@3=8@1 


where the “1” is 


yn’ = (1/V3)(uu + dd + ss), 
and the two members of the octet at the center are: 


nw = (1/V2)(uu — dd) and ° = (1/V6)(uu + dd — 2ss). 
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The action of I_ on x* is to transform it into a nm’. This operation has 
the following meaning in terms of [acting on the tensor product, 
u@d: 
Lu® d) = (Lu) @d+u® (Ld) (c.f. derivative rule) 
boo 
L(m)= d @d+u@u 


=> n° 
Omitting the tensor product sign, normalizing the amplitudes, and 
choosing the phases in the generally accepted way, we have: 
nw? = (1/V2)(uu — dd). 
The singlet n° is said to be orthogonal to r° and y° at the origin. 

If the symmetry of the octet were exact, the eight members 
of the octet would have the same mass. This is not quite the case; 
the symmetry is broken by the difference in effective mass between 
the u- and d-quark (essentially the same effective masses: ~ 300 
MeV/c’) and the s-quark (effective mass ~ 500 MeV/c’). (It should 
be noted that the effective masses of the quarks, derived from the 
mass differences of hadron-pairs, is not the same as the “current- 
quark” masses that appear in the fundamental theory. The 
discrepancy between the effective masses and the fundamental 
masses is not fully understood at this time). 

The decomposition of 3 ® 3 © 3 is 

3®3®©3=(6 8 3) ©3 
= 10080801 
in which the states of the 10 are symmetric, the 1 is antisymmetric, 
and the 8, 8° states are of mixed symmetry. The decuplet that 


appears in this decomposition is associated with the observed 
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decuplet of spin-3/2 baryons. In terms of the three fundamental 
quarks — u, d, and s, the make -up of the individual members of the 


decuplet is shown schematically in the following diagram: 


ddd ~ dud ~ uud uuu 


~ dds ~ dus ~ uus 


~ sds ~ sus 


The precise make-up of each state, labeled by (Y, I, I,,) is: 


(1, 3/2, 43/2) = uuu” 

(1, 3/2,41/2) = (1/V3)(udu + duu + uud) 

(1, 3/2,-1/2) = (1/V3)(ddu + udd + dud) 

(1, 3/2, -3/2) = ddd 

(0, 1, +1) = (1/V3)(usu + suu + uus) 

(0,1, O)= (1/v6)(uds + dsu + sud + dus + sdu + usd) 
(0, 1,-1) = (1/V3)(dsd + sdd + dds) 


(-1, 1/2,+1/2) = (1/V3)(ssu + uss + sus) 
(-1,1/2,-1/2) = (1/V3)(ssd + dss + sds) 
(-2,0,0) = sss 
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The general theory of the permutation group of n entities, 
and its representations, is outside the scope of this introduction. 
The use of the Young tableaux in obtaining the mixed symmetry 
states is treated in Hamermesh (1962). 

The charges of the A**, A", and Q particles fix the fractional 
values of the quarks, namely 


quark flavor charge (in units of the electron charge) 


u +2/3 
d —1/3 
S -1/3 


The charges of the anti-quarks are opposite in sign to these values. 
Extensive reviews of the 3-quark model and its application 
to the physics of the low-energy part of the hadron spectrum can be 
found in Gasiorowicz (1966) and Gibson and Pollard (1976). 
13.6 The need for a new quantum number: hidden color 
Immediately after the introduction of the 3-quark model by 
Gell-Mann and Zweig, it was recognized that the model was not 
consistent with the extended Pauli principle when applied to bound 
states of three quarks. For example, the structure of the spin-3/2 A* 
state is such that, if each quark is assigned a spin s, = 1/2, the three 
spins must be aligned } {{ to give a net spin of 3/2. (It is assumed 
that the relative orbital angular momentum of the quarks in the A* is 
zero (a symmetric s-state) — a reasonable assumption to make, as it 
corresponds to minimum kinetic energy, and therefore to a state of 
lowest total energy). The quarks are fermions, and therefore they 
must obey the generalized Pauli Principle; they cannot exist in a 


completely aligned spin state when they are in an s-state that is 
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symmetric under particle (quark) exchange. The unitary spin 
component of the total wavefunction must be anti-symmetric. 
Greenberg (1964) proposed that a new degree of freedom must be 
assigned to the quarks if the Pauli Principle is not to be violated. 
The new property was later called “color”, a property with profound 
consequences. A quark with a certain flavor possesses color (red, 
blue, green, say) that corresponds to the triplet representation of 
another form of SU(3) — namely SU(3)., where the subscript C 
differentiates the group from that introduced by Gell-Mann and 
Zweig — the flavor group SU(3),. The anti-quarks (that possess 
anti-color) have a triplet representation in SU(3). that is the 
conjugate representation (the 3). Although the SU(3),; symmetry is 
known not to be exact, we have evidence that the SU(3). symmetry 
is an exact symmetry of Nature. Baryons and mesons are found to 
be colorless; the color singlet of a baryon occurs in the 
decomposition 

SU(3).=3®3@®3=104+8+8° +1. 
The meson singlets consist of linear combinations of the form 

1= (RR + BB + GG)3. 

Although the hadrons are colorless, certain observable 
quantities are directly related to the number of colors in the model. 
For example, the purely electromagnetic decay of the neutral pion, 
mn’, into two photons 

m= +, 
has a lifetime that is found to be closely proportion! to the square of 
the number of colors. (Adler (1970) gives T = h/t = 1(eV) (number 


of colors)’ 
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The measurements of the lifetime give a value of I[~8 eV, 


consistent with N.,,, = 3. Since these early measurements, refined 


cols 
experiments have demonstrated that there are three, and only three, 
colors associated with the quarks. 

In studies of electron-positron interactions in the GeV- 
region, the ratio of cross sections: 

R =o(e*e — hadrons)/o(e*e = u*w) 
is found to depend linearly on the number of colors. Good 
agreement between the theoretical model and the measured value of 
R, over a wide range of energy, is obtained for three colors. 

The color attribute of the quarks has been responsible for the 
development of a theory of the strongly interacting particles, called 
quantum chromodynamics. It is a field theory in which the quarks 
are generators of a new type of field — the color field. The 
mediators of the field are called gluons; they possess color, the 
attribute of the source of the field. Consequently, they can interact 
with each other through the color field. This is a field quite unlike 
the electrodynamic field of classical electromagnetism, in which the 
field quanta do not carry the attribute of the source of the field, 
namely electric charge. The photons, therefore, do not interact with 
each other. 

The gluons transform a quark of a particular color into a 
quark of a different color. For example, in the interaction between 
a red quark and a blue quark, the colors are exchanged. This 
requires that the exchanged gluon carry color and anti-color, as 


shown: 
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qb q: 


gluon, g,, carries red and anti-blue: 


q: qb 


The color lines are continuous. 


Three different colors permit nine different ways of coupling 
quarks and gluons. Three of these, red-red, blue-blue, and green- 
green do not change the colors. A linear combination ~ (RR + 
B—B + G-—G) is symmetric in the color labels, and_ this 
combination is the singlet state of the group SU(3),. Eight gluons, 
each with two color indices, are therefore required in the 3-color 
theory of quarks. 

13.7 More massive quarks 

In 1974, the results of two independent experiments one, a 
study of the reaction p + Be > e* +e ... (Ting et al.), the other a 
study of e* + e — hadrons ... (Richter et al), showed the presence 
of a sharp resonance at a center-of-mass energy of 3.1 GeV. The 
lifetime of the resonant state was found to be ~10~° seconds — 
more than 10° seconds longer than expected for a state formed in 
the strong interaction. The resonant state is called the Jp. It was 
quickly realized that the state corresponds to the ground state of a 


new quark—anti-quark system, a bound state cc, where c is a fourth, 
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massive, quark endowed with one unit of a new quantum number c, 
called “charm”. The quantum numbers assigned to the c-quark are 
JP = 1/2*,c = 1, Q/e = +2/3, and B = 1/3. 

Sound theoretical arguments for a fourth quark, carrying a 
new quantum number, had been put forward several years before 
the experimental observation of the J/Ap state. Since 1974, a 
complex set of states of the “charmonium” system has been 
observed, and their decay properties studied. Detailed comparisons 
have been made with sophisticated theoretical models of the system. 

The inclusion of a charmed quark in the set of quarks means 
that the group SU(4), must be used in place of the original Gell- 
Mann-Zweig group SU(3),. Although the SU(4),; symmetry is 
badly broken because the effective mass of the charmed quark is ~ 
1.8 GeV/c’, some useful applications have been made using the 
model. The fundamental representations are 

[u, d,s, c], a covariant column spinor, 
and 

(u, d, s, c), a contravariant row spinor. 
The irreps are constructed in a way that is analogous to that used in 
SU(3)p, namely, by finding the symmetric and anti-symmetric 
decompositions of the various tensor products. The most useful are: 

4@®4=15 01, 

4@4=1006, 

4@4© 4=20,,,, © 20, © 207, © 4 nis 

and 


15 @ 15=1@ 15,,, © 15,4; ® 20, © 45 © 45 © 84. 


sym 
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The “15” includes the non-charmed (J’ = 0° ) mesons and the 
following charmed mesons: 

D° = cu, D° = cu, mass = 1863MeV/c’, 

D* = cd, D = cd, mass = 1868 MeV/c’, 

F* =cs,F =cs, mass = 2.04 MeV/c’. 
In order to discuss the baryons, it is necessary to include the quark 
spin, and therefore the group must be extended to SU(8)-. 
Relatively few baryons have been studied in detail in this extended 
framework. 

In 1977, well-defined resonant states were observed at 
energies of 9.4, 10.01, and 10.4 GeV, and were interpreted as bound 
states of another quark, the “bottom” quark, b, and its anti-partner, 
the b. Mesons can be formed that include the b-quark, thus 

B, = bu, B,° = bd, B,° = bs, and B, = be . 
The study of the weak decay modes of these states is currently 
fashionable. 

In 1994, definitive evidence was obtained for the existence 
of a sixth quark, called the “top” quark, t. It is a massive entity 
with a mass almost 200 times the mass of the proton. 

We have seen that the quarks interact strongly via gluon 
exchange. They also take part in the weak interaction. In an earlier 
discussion of isospin, the group generators were introduced by 
considering the B-decay of the free neutron, n? > p* +e + v°. We 
now know that, at the microscopic level, this process involves the 
transformation of a d-quark into a u-quark, and the production of 


the carrier of the weak force, the massive W particle. The W~ 
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boson (spin 1) decays instantly into an electron—anti-neutrino pair, 


as shown: 
rv 
w-! aes 
d SY u i“ 
neutron, n° d(-1/3) = u(42/3) proton, p+ 
u > 
d d 
> 


The carriers of the Weak Force, W*, Z°, were first identified in p-p 


collisions at high center-of-mass energy. The processes involve 


quark—anti-quark interactions. 


e ——~> e 
u(+2/3) 7 u (-2/3) 
—> eo +— 
wt 
u(+2/3) v° d(+1/3) 
Vv 
W/m ew 
d(-1/3) u(—2/3) 
a <+—_ 


The charge is conserved at each vertex. 
The carriers have very large measured masses: 


mass W* ~ 81 GeV/c’, and mass Z° ~ 93 GeV/c’. 


145 


(Recall that the range of a force « 1/(mass of carrier); the W and Z 

masses correspond to a very short range, ~10°'* m, for the Weak 
Force). 

Any quantitative discussion of current work using Group 
Theory to tackle Grand Unified Theories, requires a knowledge of 
Quantum Field Theory that is not expected of readers of this 
introductory book. 

14 
LIE GROUPS AND THE CONSERVATION LAWS OF THE 
PHYSICAL UNIVERSE 
14.1 Poisson and Dirac Brackets 
The Poisson Bracket of two differentiable functions 


A(Pi; Pos «Pn» G1» Go» +n) 


and 
B(D1, Pos +--Pas G1» G2> +n) 

of two sets of variables (p,, p>, ...p,) and (q,, qo, ...g,) is defined as 

{A, B} = ¥," (0A/0q,)(0B/op;) — (0A/0p;)(0B/dq;) . 

If A = O(p,, g,), a dynamical variable, and 
B = H(p,, q,), the hamiltonian of a dynamical system, 

where p, is the (canonical) momentum and q, is a (generalized) 
coordinate, then 

{O, H} =>," (0O/0q;)(0OH/0p,) — (0O/0p;)(0H/0q,) . 
(n is the’number of degrees of freedom” of the system). 
Hamilton’s equations are 

OH/op, = dq,/dt and 0H/0q; = — dp,/dt , 
and therefore 


{O, H} = 5," (60/8q)(dq,/dt) + (80/Aq,)(dp,/dt) . 
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The total differential of O(p,, q;) is 
dO = >," (0O/dq;)dq; + (0O/dp;)dp,. 
and its time derivative is 


(dO/dt) = ¥ ," (80/dq,)(dq,/dt) + (8O/dp,)(dp,/dt) 


={O,H}=0O. 
If the Poisson Bracket is zero, the physical quantity O is a 
constant of the motion. 
In Quantum Mechanics, the relation 
(dO/dt) = {O, H} 
is replaced by 
(dO/dt) = -Gi/h))[O, H], 
Heisenberg’s equation of motion. It is the custom to refer to the 
commutator [O, H] as the Dirac Bracket. 

If the Dirac Bracket is zero, the quantum mechanical 
quantity O is a constant of the motion.. 

(Dirac proved that the classical Poisson Bracket {O, H} can 
be identified with the Heisenberg commutator —(i/h)[O, H] by 
making a suitable choice of the order of the q’s and p’s in the 
Poisson Bracket). 

14.2 Infinitesimal unitary transformations in Quantum 
Mechanics 
The Lie form of an infinitesimal unitary transformation is 

U=I1+ i60X/h , 
where 6a ia real infinitesimal parameter, and X is an hermitian 
operator. (It is straightforward to show that this form of U is, 


indeed, unitary). 
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Let a dynamical operator O change under an infinitesimal 
unitary transformation: 
O — O’ = UOU' 
= (I + idaX/h)OU — i6aX/h) 
= O - idaOX/h + i6aXO/f to 1st-order 
= O + i(6aXO — OdaX)/h 
= 0+ i(FO - OF)/n. 


where 
F = daX. 
The infinitesimal change in O is therefore 
s0=O°-O 
=i[F, O]/h 


If we identify F with —H6t (the classical form for a purely temporal 
change in the system) then 
60 = i[-Hdt, O]/h, 

or 

—60 = i[H, O]st/h , 
so that 

—d0/dt = if[H, O}/h. 
For a temporal change in the system, 50/dt = — dO/dt. 
The fundamental Heisenberg equation of motion 
dO/dt = i[H ,O]/h 

is therefore deduced from the unitary infinitesimal transformation 
of the operator O. 
This approach was taken by Schwinger in his formulation of 


Quantum Mechanics. 
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IF| = Hét is directly related to the generator, X, of a 
Quantum Mechanical infinitesimal transformation, and therefore we 
can associate with every symmetry transformation of the system an 
hermitian operator F that is a constant of the motion - its 
eigenvalues do not change with time. This is an example of 
Noether’s Theorem: 

A conservation law is associated with every symmetry of the 
equations of motion. If the equations of motion are unchanged by 
the transformations of a Group then a property of the system will 
remain constant as the system evolves with time. As a well-known 
example, if the equations of motion of an object are invariant under 
translations in space, the linear momentum of the system is 


conserved. 
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